Xiaomi MiMo-VL-7B | New Vision Language AI Model
Xiaomi has released MiMo-VL-7B, a powerful open-source vision-language AI model with only seven billion parameters that outperforms much larger models like Claude Sonnet and Qwen72B.
2025-06-07 02:00:00 - AI Revolution
It can analyze images, videos, and text with high accuracy, handle complex reasoning, and run on consumer-level hardware.
Trained on over two trillion tokens and refined with reinforcement learning, MiMo-VL-7B marks a major leap in efficient, multimodal AI technology from China.
https://github.com/XiaomiMiMo/MiMo-VL
https://huggingface.co/XiaomiMiMo