Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

上海的陆家嘴
0

Beijing, China – March 23, 2025 – Horizon Robotics, a leading provider of advanced driver-assistance systems (ADAS) and autonomous driving (AD) solutions, today announced AlphaDrive, a novel framework leveraging reinforcement learning and planning-inference for large language models (LLMs) in autonomous driving. This breakthrough aims to address the limitations of existing end-to-end models in handling complex, long-tail scenarios.

The development of AlphaDrive comes at a time when advancements in artificial intelligence are rapidly transforming various fields. Models like OpenAI’s o1 and DeepSeek’s R1 have demonstrated superhuman performance in mathematics and science, largely attributed to their sophisticated reinforcement learning training and inference techniques. While end-to-end models have significantly improved planning and control in autonomous driving, they often struggle with situations requiring common sense reasoning and long-term planning.

Previous attempts to integrate vision-language models (VLMs) into autonomous driving have primarily relied on pre-trained models fine-tuned with supervised learning on driving data. However, these approaches often lack targeted training strategies optimized for the ultimate goal of decision-making and planning.

To overcome these challenges, Horizon Robotics developed AlphaDrive, a reinforcement learning and planning-inference training framework specifically designed for VLMs in autonomous driving. The project is open-sourced and accessible on GitHub: https://github.com/hustvl/AlphaDrive. The corresponding research paper is available on arXiv: https://arxiv.org/abs/2503.07608.

Key Innovations of AlphaDrive:

  • GRPO Rewards: AlphaDrive introduces four novel reinforcement learning rewards tailored for planning, referred to as GRPO rewards. The specific details of these rewards are outlined in the research paper.
  • Two-Stage Training Strategy: The framework employs a two-stage training strategy based on supervised fine-tuning (SFT) and reinforcement learning (RL). This approach allows the model to first learn from human-labeled data and then refine its decision-making capabilities through interaction with the environment.

We believe AlphaDrive represents a significant step forward in the development of robust and reliable autonomous driving systems, said a spokesperson for Horizon Robotics. The emergent multi-modal planning capabilities exhibited by AlphaDrive during the reinforcement learning phase are reminiscent of the ‘Aha Moment’ observed in DeepSeek R1, further validating the power of reinforcement learning in complex reasoning tasks.

The introduction of AlphaDrive highlights the growing importance of reinforcement learning and advanced AI techniques in the pursuit of truly autonomous vehicles. By combining the strengths of VLMs with targeted reinforcement learning strategies, Horizon Robotics is paving the way for autonomous driving systems capable of navigating the complexities of the real world with greater safety and efficiency.

Looking Ahead:

The development of AlphaDrive opens up new avenues for research and development in autonomous driving. Future work will focus on further refining the GRPO rewards, exploring different reinforcement learning algorithms, and evaluating the performance of AlphaDrive in real-world driving scenarios. The open-source nature of the project encourages collaboration and innovation within the autonomous driving community, accelerating the development of safer and more reliable self-driving technologies.

References:


>>> Read more <<<

Views: 0

0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注