Post #2161

@huggingface

Hugging Face

Visualizzazioni25Numero di visualizzazioni

Pubblicato7 gen07/01/2026, 01:06

Contenuto del post

Contenuto

Hugging Face (Twitter) RT @NVIDIAAIDev: Just launched #CES2026, the new open-source NVIDIA Nemotron Speech ASR model is here to solve latency drift and redundant compute. Its cache-aware streaming architecture eliminates the need for buffered inference, giving you stable, sub-100ms latency (24ms median T-T-F) and up to 3x more throughput on your GPU. 🤗 Read the technical blog with real-world results from @trydaily and @modal on @HuggingFace: nvda.ws/3Lt8m3Q