TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #15421 · Jan 18

#python#audio#deeplearning#minicpm#python#pytorch#speech#speech_synthesis#text_to_speech#tts#tts_model#voice_cloning VoxCPM is a free, open-source TTS tool that turns text into realistic speech without tokens, creating expressive audio that matches context and clones voices perfectly from just 3-10 seconds of sample. Download VoxCPM1.5 (800M params) from Hugging Face, install via pip, and use simple Python or CLI commands for fast synthesis (RTF 0.15 on RTX 4090) or fine-tuning your own voices. You benefit by easily making natural audiobooks, podcasts, clones, or apps with pro-quality sound—saving time and costs on voice work. https://github.com/OpenBMB/VoxCPM

Results

3 similar posts found

Search: #sounds

当前筛选 #sounds清除筛选
Interesting Planet 🌍

@interesting_planet_facts · Post #1053 · 11/19/2025, 06:11 PM

🌎 In 1977, the Soviet Venera 14 probe recorded mysterious low-frequency “thunder”-like sounds on Venus. Scientists now attribute these to seismic activity or wind interacting with the planet’s dense atmosphere. Venus’s surface winds move slowly, but thick air carries sound much farther than on Earth. ✨ #Venus⚡#sounds⚡#space 👉subscribe Interesting Planet 👉more Channels ​

djangoproject

@djangoproject · Post #255 · 02/02/2017, 06:57 PM

https://github.com/tyiannak/pyAudioAnalysis #pyAudioAnalysis is a Python library covering a wide range of audio analysis tasks. Through pyAudioAnalysis you can: Extract #audio features and representations (e.g. mfccs, spectrogram, chromagram) Classify unknown #sounds Train, parameter tune and evaluate classifiers of audio segments Detect audio events and exclude silence periods from long recordings Perform supervised segmentation (joint segmentation - classification) Perform unsupervised segmentation (e.g. speaker diarization) Extract audio thumbnails Train and use audio regression models (example application: emotion recognition) Apply dimensionality reduction to visualize audio data and content similarities