Skip to content

Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

License

Notifications You must be signed in to change notification settings

hustvl/AlphaDrive

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

🌌 AlphaDrive: Unleashing the Power of VLMs in Autonomous

Driving via Reinforcement Learning and Reasoning

Bo Jiang1, Shaoyu Chen1,2, Qian Zhang2, Wenyu Liu1, Xinggang Wang1,📧

1 Huazhong University of Science and Technology, 2 Horizon Robotics, 📧 corresponding author

arxiv paper License

vis.mp4

✨ Highlights

  • To the best of our knowledge, AlphaDrive is the first to integrate GRPO-based RL with planning reasoning to autonomous driving, significantly boosting both performance and training efficiency.

  • We are excited to discover that, following RL training, AlphaDrive exhibits some emergent multimodal planning capabilities, which is promising for improving driving safety and efficiency.

📋 News

[2025-3-11]: AlphaDrive arXiv paper released. Code are coming soon. Please stay tuned! ☕️

📊 Qualitative Results

❤️ Acknowledgements

This repo is built on open-r1 and R1-V. We sincerely thank the contributors for their great work!

📚 Citation

If you find AlphaDrive useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{jiang2025alphadrive,
      title={AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning}, 
      author={Bo Jiang and Shaoyu Chen and Qian Zhang and Wenyu Liu and Xinggang Wang},
      year={2025},
      eprint={2503.07608},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.07608}, 
}

🥰 Related Projects

Check out our other awesome projects:

VAD & VADv2: Vectorized Scene Representation for Efficient Autonomous Driving.

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving.

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving.

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning.

MapTR: An End-to-End Framework for Online Vectorized HD Map Construction.

Releases

No releases published

Packages

No packages published