TGINSIGHT CHAT
Libreware
@libreware
TechnologiesLibreware Software Library 📡t.me/Libreware ★ Send us your suggestions and menaces here: https://t.me/joinchat/nMOOE4YJPDFhZjZk
Recent posts
Tag: #stt · 5 posts
Posted Apr 19
Speech Note #Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation https://github.com/mkiol/dsnote MPL-2.0 license https://github.com/mkiol/dsnote#how-to-install Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet. Speech Note uses many different processing engines to do its job. Currently these are used: Speech to Text (STT) Coqui STT (a fork of Mozilla DeepSpeech) Vosk whisper.cpp Faster Whisper april-asr Text to Speech (TTS) espeak-ng MBROLA Piper RHVoice Coqui TTS Mimic 3 WhisperSpeech Kokoro Parler-TTS F5-TTS S.A.M. Machine Translation (MT) Bergamot Translator
Posted Apr 19
Speed of Sound #Voice#typing for the #Linux desktop: Features Offline, on-device transcription powered by Whisper, Parakeet, Canary, and more. No data leaves your machine. Multiple activation options: click the in-app button or use a global keyboard shortcut. Types the result directly into any focused application using Portals for wide desktop support (X11, Wayland). Multi-language support with switchable primary and secondary languages on the fly. Works out of the box with a built-in multilingual Whisper model. Download additional models from within the app to improve accuracy and language coverage. Optional text polishing with LLMs (Anthropic, Google, OpenAI), with support for a custom context and vocabulary. Supports self-hosted services like vLLM, Ollama, and llama.cpp (cloud services supported but not required). Getting Started The easiest and recommended way to install Speed of Sound is from Flathub or from the Snap Store. Alternatively, AppImage, Deb, and RPM packages are also available from the releases page. For initial configuration, troubleshooting, and other resources, visit speedofsound.io #stt
Posted Sep 19
Dicio assistant Dicio is a free and open source#voice#assistant running on #Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It interprets user input and (when possible) generates user output entirely on-device, providing privacy by design. It has multilanguage support, and is currently available in these languages: Czech, English, French, German, Greek, Italian, Polish, Russian, Slovenian, Spanish, Swedish and Ukrainian. Open to contributions :-D https://github.com/Stypox/dicio-android Download https://f-droid.org/packages/org.stypox.dicio https://github.com/Stypox/dicio-android/releases https://play.google.com/store/apps/details?id=org.stypox.dicio Skills Currently Dicio answers questions about: search: looks up information on DuckDuckGo (and in the future more engines) - Search for Dicio weather: collects weather information from OpenWeatherMap - What's the weather like? lyrics: shows Genius lyrics for songs - What's the song that goes we will we will rock you? open: opens an app on your device - Open NewPipe calculator: evaluates basic calculations - What is four thousand and two times three minus a million divided by three hundred? telephone: view and call contacts - Call Tom timer: set, query and cancel timers - Set a timer for five minutes current time: query current time - What time is it? navigation: opens the navigation app at the requested position - Take me to New York, fifteenth avenue media: play, pause, previous, next song Speech to text Dicio uses Vosk as its speech to text (#STT) engine. In order to be able to run on every phone small models are employed, weighing ~50MB. The download from here starts automatically whenever needed, so the app language can be changed seamlessly.
Hashtags
Posted Aug 7
WhisperTux Simple #voice#dictation application for #Linux. Uses whisper.cpp for offline speech-to-text transcription. No fancy GPUs are required although whisper.cpp is capable of using them if available. Once your speech is transcribed, it is sent to a ydotool daemon that will write the text into the focused application. Features Local speech-to-text processing via whisper.cpp (no cloud dependencies) No expensive hardware required (works well on a plain x86 laptop with AVX instructions) Global keyboard shortcuts for system-wide operation Automatic text injection into focused applications Configurable whisper models and shortcuts https://github.com/cjams/whispertux #assistant#speech#stt
Posted Oct 6
#Linux Desktop application that provides live #captioning FUTO Fellowship program interview; linux captions software 👉 Live Captions github: https://github.com/abb128/LiveCaptions 🔵 Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf https://www.youtube.com/watch?v=OJPmbcU-Vzo 🔵 FUTO Fellows program: https://futo.org/fellows/ 🔵 FUTO Youtube channel - @futotech ⚠️ Google's breaches of privacy have gone TOO FAR! https://www.youtube.com/watch?v=_vWAF13KigI #speech#recognition#stt#voice