InternVL 2.5 | Lab Index

First open-source multimodal LLM to surpass 70% on MMMU. Investigates scaling of vision encoders, language models, datasets, and test-time Chain-of-Thought reasoning. Also retroactively documents InternVL 2.0.

No results found