Contenuto
Hugging Face (Twitter) RT @kwindla: Smart Turn v3.1. Smart Turn is a completely open source, open data, open training code turn detection model for voice AI, trained on audio data across 23 languages. The model operates on the input audio in a voice agent pipeline. Each time the user pauses briefly, this model runs and returns a binary decision about whether the user has finished speaking or not. The 3.1 release has two big improvements: 1. New data sets for English and Spanish, collected and labeled by contributors Liva AI, Midcentury, and MundoAI. The majority of the training data for the Smart Turn model is synthetically generated. Using synthetic data makes it possible to scale up training for a model like this. We've done a lot of work on the synthetic data pipeline to emulate as much of the natural variability of human speech as possible. But accurately labeled human data is very valuable and has a measurable impact on model quality. The 3.1 training run... Перейти на оригинальный пост