
NVIDIA open-weight multilingual streaming ASR: 600M, 40 locales from one model. Cache-aware encoder recomputes nothing; 17x more concurrent streams vs Parakeet RNNT 1.1B at 80ms. Native punctuation + capitalization, auto language detect. 80ms to 1.12s latency. NeMo. OpenMDW-1.1.
9Jun 6, 2026, 7:10 PM