华南理工大学学报(自然科学版) ›› 2026, Vol. 54 ›› Issue (3): 31-51.doi: 10.12141/j.issn.1000-565X.250188

• 智慧交通系统 • 上一篇    下一篇

自动驾驶汽车交互场景学习型生成方法研究进展

熊璐, 冯浩杰, 张培志, 田梦杰, 张心睿   

  1. 同济大学 汽车与能源学院/新能源汽车工程中心,上海 201804
  • 收稿日期:2025-06-30 出版日期:2026-03-25 发布日期:2025-09-12
  • 通信作者: 张培志(1988—),男,博士,工程师,主要从事智能网联汽车测试、智能车辆控制研究。 E-mail:zhangpeizhi@tongji.edu.cn
  • 作者简介:熊璐(1978—),男,教授,博士生导师,主要从事新能源汽车底盘动力学控制、智能网联汽车测试研究。E-mail: xiong_lu@tongji.edu.cn
  • 基金资助:
    国家重点研发计划项目(2024YFB2505704)

Research Advances in Learning-Based Generative Methods for Interactive Scenarios of Autonomous Vehicles

XIONG Lu, FENG Haojie, ZHANG Peizhi, TIAN Mengjie, ZHANG Xinrui   

  1. College of Automotive and Energy/Clean Energy Automotive Engineering Centre,Tongji University,Shanghai 201804,China
  • Received:2025-06-30 Online:2026-03-25 Published:2025-09-12
  • Contact: 张培志(1988—),男,博士,工程师,主要从事智能网联汽车测试、智能车辆控制研究。 E-mail:zhangpeizhi@tongji.edu.cn
  • About author:熊璐(1978—),男,教授,博士生导师,主要从事新能源汽车底盘动力学控制、智能网联汽车测试研究。E-mail: xiong_lu@tongji.edu.cn
  • Supported by:
    the National Key R & D Program of China(2024YFB2505704)

摘要:

交互场景因其高维复杂性和高风险性,成为自动驾驶汽车测试与运行中的关键安全挑战之一。近年来,以数据生成、对抗生成、知识生成及大语言模型为代表的学习型场景生成方法凭借较高的真实性与覆盖广度,在提升交互场景生成质量与测试验证效率方面展现出显著优势。为系统梳理该领域的研究进展、核心技术与瓶颈问题,该文对自动驾驶交互场景的学习型生成方法进行全面综述,可为后续研究提供清晰的技术脉络与发展方向。首先,概述交互场景的基本特征,并对主流公开数据集进行对比分析,探讨其对学习型生成方法的支撑作用;同时,总结交互相关的标准规范、场景描述与衡量指标,分析其对交互行为表达能力的影响;其次,从传统学习方法与大语言模型方法两个方向出发,归纳各类方法的技术路线与生成效果;最后,结合现有成果,总结当前学习型生成方法在数据和方法等方面面临的挑战,展望未来的研究方向。研究发现,传统学习方法在真实性与多样性的平衡、安全关键场景的高效发现与生成方面存在固有挑战;大语言模型方法虽在场景语义理解和复杂逻辑构建上展现出卓越潜力,但普遍面临推理速度慢、信息虚构、生成内容与物理世界对齐困难以及决策过程“黑盒”特性导致的可解释性不足等瓶颈。此外,当前研究仍共同受制于现有数据集缺乏交互关键信息、场景严重性缺少统一衡量标准等基础性问题。该文认为,尽管已有方法在交互建模上取得了初步成果,但仍需进一步提升生成质量与泛化能力,以满足自动驾驶汽车实际需求。

关键词: 自动驾驶汽车, 交互场景, 交互数据集, 交互场景描述, 数据生成, 对抗生成, 知识生成, 大语言模型

Abstract:

Interactive traffic scenarios, characterized by high-dimensional complexity and inherent risk, constitute a critical safety challenge in the testing and operation of autonomous vehicles (AVs). In recent years, learning-based generative method—exemplified by data generation, adversarial generation knowledge-driven generation, and large language models (LLMs)—has demonstrated significant advantages in improving the quality of interactive scenario generation and the efficiency of testing and validation, owing to their superior realism and coverage breadth. To systematically chart the advances, core techniques, and bottleneck issues in this domain, this paper presents a comprehensive review of learning-based generative methods for autonomous vehicle interactive scenarios, aiming to offer a clear technical roadmap and development direction for subsequent research. First, the fundamental attributes of interactive scenarios are outlined, and mainstream public datasets are comparatively analyzed to discuss their supportive role for learning-based generative methods. Concurrently, relevant standards, scenario description frameworks, and evaluation metrics for interaction are summarized, with an analysis of their impact on the expressive capacity of intertive behaviors. Second, the technical approaches and generation outcomes of various methods are categorized along two main directions: traditional learning-based methods and large language model-based me-thods. Finally, building upon existing findings, the article summarizes the current challenges faced by learning-based generative methods, both in terms of data and methodology, and proposes future research directions. The study finds that traditional learning-based methods face inherent challenges in balancing realism and diversity, as well as in the efficient discovery and generation of safety-critical scenarios. While LLM-based approaches show exceptional potential in semantic understanding and constructing complex logical narratives, they commonly encounter bottlenecks such as slow inference speeds, information hallucination, difficulties in aligning generated content with physical world constraints, and a lack of interpretability due to the “black-box” nature of their decision processes. Moreover, current research is still constrained by fundamental issues, including the scarcity of interaction-critical information in existing datasets and the absence of a unified metric for quantifying scenario criticality. This article concludes that although existing methods have achieved preliminary success in interaction modeling, significant improvements in generation quality and generalization capability are still required to meet the practical demands of autonomous vehicle development.

Key words: autonomous vehicles, interactive scenario, interactive dataset, interactive scenario description, data generation, adversarial generation, knowledge generation, large language model

中图分类号: