Latent motion tokens as a bridging language for learning robot manipulation from videos. Accepted at ICCV 2025 as Oral.

Paper

arXiv: 2412.04445

Venue: ICCV 2025

embodiedvideoresearch