TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #15099 · Aug 27

#python#audiobooks#epub#kokoro#python#tts You can easily turn e-books in .epub format into high-quality audiobooks using Audiblez, a free tool that uses Kokoro's natural-sounding text-to-speech voices in many languages. It works on Windows, Mac, and Linux, with options for command line or a simple graphical interface. You can choose different voices, adjust reading speed, and even pick specific chapters to convert. Using a GPU speeds up the process significantly. The final audiobook is saved as an .m4b file, playable on most audiobook apps. This saves you time and money compared to hiring narrators and lets you listen to books hands-free anywhere. https://github.com/santinic/audiblez

Results

6 similar posts found

Search: #recognition

当前筛选 #recognition清除筛选
Libreware

@libreware · Post #1084 · 05/04/2022, 09:32 AM

Vosk Speech Recognition Toolkit Vosk is an offline open source #speech#recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification. Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others. Vosk supplies speech recognition for chatbots, smart home appliances, virtual assistants. It can also create subtitles for movies, transcription for lectures and interviews. Vosk scales from small devices like Raspberry Pi or Android smartphone to big clusters. https://t.me/speech_recognition https://alphacephei.com/vosk https://github.com/alphacep/vosk-api

Libreware

@libreware · Post #1021 · 01/09/2022, 02:56 PM

SongRec An open-source Shazam client for Linux, written in Rust. Features: • Recognize audio from an audio file. • Recognize audio from the microphone. • Usage from both GUI and command line. • Provide an history of the recognized songs. • Continuous song detection. • Ability to recognize songs from your speakers rather than your microphone. Download: https://github.com/marin-m/SongRec#installation https://github.com/marin-m/SongRec @foss_desktop #music#shazam#recognition

Libreware

@libreware · Post #1192 · 10/06/2023, 11:18 AM

#Linux Desktop application that provides live #captioning FUTO Fellowship program interview; linux captions software 👉 Live Captions github: https://github.com/abb128/LiveCaptions 🔵 Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf https://www.youtube.com/watch?v=OJPmbcU-Vzo 🔵 FUTO Fellows program: https://futo.org/fellows/ 🔵 FUTO Youtube channel - @futotech ⚠️ Google's breaches of privacy have gone TOO FAR! https://www.youtube.com/watch?v=_vWAF13KigI #speech#recognition#stt#voice

Libreware

@libreware · Post #1114 · 03/09/2023, 10:58 PM

https://writeout.ai #Transcribe and #translate any #audio file. 100% free to use. This website with source code available (it can be hosted locally) allows you to upload any audio file and receive a transcription and/or text translation. It uses OpenAI's Whisper API on the back end. Source on GitHub: https://github.com/beyondcode/writeout.ai #writeout#ai#speech#recognition