자율주행 연구노트

To the Next Clever Move

Motion Planning · VLA · World Model — 자율주행 연구 노트

To the Next Clever Move

imitation learning 2

[논문 리뷰] RT-H: Action Hierarchies Using Language

VLA는 바로 Action을 출력해야 할까, 아니면 중간에 Language Motion이 필요할까?본 포스팅은 RT-H: Action Hierarchies Using Language 논문을 읽고 정리한 글입니다.RT-1은 이미지와 자연어 명령을 입력받아 robot action을 출력하는 language-conditioned robot policy를 제안했고, RT-2는 Vision-Language Model, VLM을 robot action까지 출력하는 Vision-Language-Action, VLA 모델로 확장했습니다.그런데 RT-1과 RT-2를 읽고 나면 자연스럽게 이런 질문이 생깁니다.고수준 task instruction에서 바로 low-level robot action을 예측하는 것이 정말 좋은..

논문 리뷰 ( Paper Review)/[VLA] Vision Language Action 2026.05.28

[논문 리뷰] RT-1: Robotics Transformer for Real-World Control at Scale

RT-2 이전, Robot Policy Transformer는 어떻게 만들어졌는가?본 포스팅은 Google Research의 RT-1: Robotics Transformer for Real-World Control at Scale 논문을 읽고 정리한 글입니다.RT-1은 RT-2 이전에 발표된 robotics transformer 계열 논문으로, 이미지와 자연어 명령을 입력받아 실제 로봇 action을 출력하는 multi-task robot policy를 제안합니다. 이 글에서는 RT-1의 데이터 구축 방식, 모델 구조, action representation, 실험 결과, 한계, 그리고 자율주행 Motion Planning 관점에서의 의미를 정리합니다.Paper LinksTypeLinkPaperarXiv..

논문 리뷰 ( Paper Review)/[VLA] Vision Language Action 2026.05.25

자율주행 연구노트

Motion Planning을 중심으로 자율주행, VLA, World Model, RL, Research Engineering을 공부하고 구현하며 기록하는 연구 노트입니다.

HierarchicalPlanning, vlm, Robotics, RT-X, VLA, 자율주행, DualSystem, autonomous driving, OpenVLA, Vision-Language-Action, RT-1, 논문리뷰, DriveLM, imitation learning, CoRL2024, world model, RT-2, GPT-Driver, Motion Planning, Robot Learning,

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

imitation learning 2

티스토리툴바