GitHub Trends

@githubtrending

See what the GitHub community is most excited about today. A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel. Author and maintainer: https://github.com/katursis

Subscribers1.0万Current channel subscribers

Tracked posts1,000Indexed post count

Recent reach1,757Sum of recent post views

Recent posts

Tag: #speech_recognition · 3 posts

当前筛选 #speech_recognition清除筛选

Posted Jan 2

Find similar View

#python#deep_learning#inference#openai#quantization#speech_recognition#speech_to_text#transformer#whisper Faster-Whisper is a fast version of OpenAI's Whisper that transcribes audio up to 4x quicker with the same accuracy, using less memory on CPU or GPU—benchmarks show it beats original Whisper (e.g., 1m03s vs 2m23s for 13-min audio on GPU). Install via `pip install faster-whisper`, no FFmpeg needed, and use simple Python code like `WhisperModel("large-v3").transcribe("audio.mp3")` for segments with timestamps. You benefit by getting quick, efficient speech-to-text for real-time apps, saving time and resources on long files or batches. https://github.com/SYSTRAN/faster-whisper

746 views

Hashtags

#python #deep_learning #inference #openai #quantization #speech_recognition #speech_to_text #transformer #whisper

Posted Jun 7

Find similar View

#jupyter_notebook#android#asr#deep_learning#deep_neural_networks#deepspeech#google_speech_to_text#ios#kaldi#offline#privacy#python#raspberry_pi#speaker_identification#speaker_verification#speech_recognition#speech_to_text#speech_to_text_android#stt#voice_recognition#vosk Vosk is a powerful tool for recognizing speech without needing the internet. It supports over 20 languages and dialects, making it useful for many different users. Vosk is small and efficient, allowing it to work on small devices like smartphones and Raspberry Pi. It can be used for things like chatbots, smart home devices, and creating subtitles for videos. This means users can have private and fast speech recognition anywhere, which is especially helpful when internet access is limited. https://github.com/alphacep/vosk-api

470 views

Posted May 8

Find similar View

#python#apple_silicon#audio_processing#mlx#multimodal#speech_recognition#speech_synthesis#speech_to_text#text_to_speech#transformers MLX-Audio is a powerful tool for converting text into speech and speech into new audio. It works well on Apple Silicon devices, like M-series chips, making it fast and efficient. You can choose from different languages and voices, and even adjust how fast the speech is. It also includes a web interface where you can see audio in 3D and play your own files. This tool is helpful for making audiobooks, interactive media, and personal projects because it's easy to use and provides high-quality audio quickly. https://github.com/Blaizzy/mlx-audio

541 views

Hashtags

#python #apple_silicon #audio_processing #mlx #multimodal #speech_recognition #speech_synthesis #speech_to_text #text_to_speech #transformers