IIIT Hyderabad unveils self-driving car equipped with 3D LIDAR

  • Industry News
  • Nov 18,24
Proffessor Madhava Krishna , Professor and Head- Robotics Research Centre and Kohli Center for Intelligent Systems (KCIS), IIIT Hyderabad, explains the works of the self-driving car.
 IIIT Hyderabad unveils self-driving car equipped with 3D LIDAR

IIIT Hyderabad’s self-driving car is an electric vehicle that performs point to point autonomous driving with collision avoidance capabilities over a wide area. Equipped with 3D LIDAR, depth cameras, GPS systems and AHRS (Attitude and Heading Reference System which essentially means sensors on three axes to estimate it’s orientation in space), the car can also accept open set natural language commands and follow those commands to reach a desired destination. SLAM-based point cloud mapping is used to map the campus environment and a LIDAR-guided real-time state estimation allows for localisation while driving. Trajectory optimisation frameworks a-la model predictive control provides for real-time rollout of optimal trajectories. Such trajectories can be initialised with data-driven models for faster inference time optimisation. A number of publications in high-profile venues and highly competitive conferences decorate the research landscape of autonomous driving research at the International Institute of Information Technology, Hyderabad, Gachibowli campus.
 
Open set navigation
To better understand open set navigation commands; let’s first consider how humans often navigate – with minimal map usage, relying on contextual cues and verbal instructions. Many navigation directions are exchanged based on recognising specific environmental landmarks or features, for example, “Take right next to the white building” or “Drop off near the entrance”. In a parallel context, autonomous driving agents need precise knowledge of their pose (or localisation) within the environment for effective navigation. These are typically achieved using high-resolution GPS or pre-built High-Definition (HD) Maps like Point Cloud, which are compute and memory-intensive.
 
Alternatively, many efforts utilise open-source topological maps (GPS-like Maps) for geolocalisation, like OpenStreetMaps (OSM), as a lightweight solution. However, they are metrically inaccurate (6-8 meters) and suffer localisation errors when navigating to arbitrary destinations within the map. Moreover, some environmental landmarks, such as open parking spaces, may not be marked in OSM maps due to their dynamic nature. Thus, at IIITH, we are interested in exploring a feasible and scalable method to localise using solely real-world landmarks, akin to human navigation.
 
IIITH’s efforts
It is intended to address it by exploiting foundational models that have a generic semantic understanding of the world which can be distilled for downstream localisation and navigation tasks. This has been achieved by augmenting open-source topological maps (like OSM) with language landmarks, such as “a bench”, “an underpass”, “football field”, etc. which resemble the cognitive localisation process employed by humans. These enable an “open-vocabulary” nature that allows navigation to places for which the model is not explicitly trained, leading to a zero-shot generalisation to new scenes. The RRC Autonomous Driving group- AutoDP at IIITH demonstrates an attempt to integrate classical methods with post-modern solutions in open-world understanding, bringing the best of both worlds to solve the ever-challenging task of precise localisation and navigation with real-world deployment through the in-house developed prototype.

Differential planning
Mapping, localisation, and planning form the key components in the autonomous navigation stack. While both the modular pipeline and end-to-end architectures have been the traditional driving paradigms, the integration of language modality is slowly becoming a defacto approach to enhance the explainability of autonomous driving systems. A natural extension of these systems in the vision-language context is their ability to follow navigation instructions given in natural language – for example, “Take a right turn and stop near the food stall.”

The primary objective is to ensure reliable collision-free planning. Traditionally, upstream predictions and perception are customised for improving the downstream tasks, which is typical in current Vision-Language-Action models and other existing end-to-end architectures. However, prediction and perception components are often tuned with their own objectives, rather than the overall navigation goal. In such a pipeline, the planning module heavily depends on the perception abilities of these models, making them vulnerable to prediction errors. Thus, end-to-end training with downstream planning tasks becomes crucial, ensuring feasibility even with arbitrary predictions from upstream perception and prediction components.
 
Our USP: NLP+VLM
To achieve this capability, IIITH has developed a lightweight vision-language model that combines visual scene understanding with natural language processing. The model processes the vehicle’s perspective view alongside encoded language commands to predict goal locations in one-shot. However, these predictions can sometimes conflict with real-world constraints. For example, when instructed to “park behind the red car,” the system might suggest a location in a non-road area or overlapping with the red car itself. To overcome this challenge, we augment the perception module with a custom planner within a neural network framework. This requires the planner to be differentiable, enabling gradient flow throughout the entire architecture during training which eventually improves both prediction accuracy and the planning quality. This end-to-end training approach with a differentiable planner serves as the key sauce of our work.

Related Stories

Automation & Robotics
Autodesk signs MoU with IIT Bombay to strengthen India’s manufacturing workforce

Autodesk signs MoU with IIT Bombay to strengthen India’s manufacturing workforce

Currently, about 5 million students in over 14,500 schools across India use Autodesk software to build skills relevant to the future workforce.

Read more
Electrical & Electronics
Bharat FIH shuts R&D and supply chain subsidiary amid revenue decline

Bharat FIH shuts R&D and supply chain subsidiary amid revenue decline

In 2021, Bharat FIH had established two subsidiaries—Bharat Taiwan Corporation and Rising Stars Hi-Tech—to enhance research, product development, and engineering capabilities.

Read more
Electrical & Electronics
PG Electroplast enters electric vehicle manufacturing for Spiro Mobility

PG Electroplast enters electric vehicle manufacturing for Spiro Mobility

Spiro Mobility, which recently opened a Global Technology Office in Pune, India, is planning to develop green mobility solutions for the Indian market.

Read more

Related Products

Tata Motors unveils facilities for development of Hydrogen propulsion tech

AUTO COMPONENTS & ACCESSORIES

Tata Motors, India?s largest automobile company, unveiled two state-of-the-art & new-age R&D facilities for meeting its mission of offering sustainable mobility solutions. The unveilings constitute of Read more

Request a Quote

Tata Motors plans petrol powertrain for Harrier and Safari SUVs

AUTO COMPONENTS & ACCESSORIES

Tata Motors is in the process of developing a new petrol powertrain for its premium sports utility vehicles, the Harrier and Safari, as confirmed by a senior company official. Currently, these models Read more

Request a Quote

Electric Vehicle Charger

AUTO COMPONENTS & ACCESSORIES

RRT Electro is engaged in manufacturing of customized Power Electronic Products over two decades having capability to Design, Develop, Prototyping, Regulatory Compliance testing & Certification, Manuf Read more

Request a Quote

Hi There!

Now get regular updates from IPF Magazine on WhatsApp!

Click on link below, message us with a simple hi, and SAVE our number

You will have subscribed to our Industrial News on Whatsapp! Enjoy

+91 84228 74016