Most people know Xiaomi for phones and scooters. Not for breaking AI inference records. That changes today. Working with inference partner TileRT, Xiaomi has hit over 1,000 tokens per second on a ...
Xiaomi's MiMo-V2.5-Pro-UltraSpeed hits over 1,000 tokens per second on commodity GPUs, 15x faster than ChatGPT and Claude.