자율주행 연구노트

To the Next Clever Move

Motion Planning · VLA · World Model — 자율주행 연구 노트

To the Next Clever Move

HybridStack 1

[논문 리뷰] "DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models"

VLM은 scene understanding과 planning을 hybrid stack에 어떻게 넣는가?본 포스팅은 DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models 논문을 읽고 정리한 글입니다.DriveLM을 읽으면 VLM이 perception–prediction–planning reasoning을 Graph VQA로 구조화할 수 있다는 점을 봤다. GPT-Driver는 trajectory를 language modeling으로 바꾸는 방향을 보여줬다.DriveVLM은 그 사이에서 다른 질문을 던진다.VLM이 장면 이해(scene understanding) 와 계층적 planning 을 language CoT로..

논문 리뷰 ( Paper Review)/[VLA] Vision Language Action 2026.06.14

자율주행 연구노트

Motion Planning을 중심으로 자율주행, VLA, World Model, RL, Research Engineering을 공부하고 구현하며 기록하는 연구 노트입니다.

Motion Planning, vlm, VLA, 논문리뷰, imitation learning, Robot Learning, Robotics, RT-1, OpenVLA, HierarchicalPlanning, Vision-Language-Action, 자율주행, CoRL2024, DualSystem, world model, autonomous driving, RT-X, RT-2, GPT-Driver, DriveLM,

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

HybridStack 1

티스토리툴바