Libreware

@libreware

Libreware Software Library 📡t.me/Libreware ★ Send us your suggestions and menaces here: https://t.me/joinchat/nMOOE4YJPDFhZjZk

Subscribers6,880Current channel subscribers

Tracked posts590Indexed post count

Recent reach23,470Sum of recent post views

Recent posts

Tag: #stt · 5 posts

当前筛选 #stt清除筛选

Posted Apr 19

Find similar View

Speech Note #Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation https://github.com/mkiol/dsnote MPL-2.0 license https://github.com/mkiol/dsnote#how-to-install Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet. Speech Note uses many different processing engines to do its job. Currently these are used: Speech to Text (STT) Coqui STT (a fork of Mozilla DeepSpeech) Vosk whisper.cpp Faster Whisper april-asr Text to Speech (TTS) espeak-ng MBROLA Piper RHVoice Coqui TTS Mimic 3 WhisperSpeech Kokoro Parler-TTS F5-TTS S.A.M. Machine Translation (MT) Bergamot Translator

3,580 views

Hashtags

#linux #sailfish #speech #stt #tts #translation

Posted Apr 19

Find similar View

Speed of Sound #Voice#typing for the #Linux desktop: Features Offline, on-device transcription powered by Whisper, Parakeet, Canary, and more. No data leaves your machine. Multiple activation options: click the in-app button or use a global keyboard shortcut. Types the result directly into any focused application using Portals for wide desktop support (X11, Wayland). Multi-language support with switchable primary and secondary languages on the fly. Works out of the box with a built-in multilingual Whisper model. Download additional models from within the app to improve accuracy and language coverage. Optional text polishing with LLMs (Anthropic, Google, OpenAI), with support for a custom context and vocabulary. Supports self-hosted services like vLLM, Ollama, and llama.cpp (cloud services supported but not required). Getting Started The easiest and recommended way to install Speed of Sound is from Flathub or from the Snap Store. Alternatively, AppImage, Deb, and RPM packages are also available from the releases page. For initial configuration, troubleshooting, and other resources, visit speedofsound.io #stt

3,610 views

Hashtags

#voice #typing #linux #stt

Posted Sep 19

Find similar View

Dicio assistant Dicio is a free and open source#voice#assistant running on #Android. It supports many different skills and input/output methods, and it provides both speech and graphical feedback to a question. It interprets user input and (when possible) generates user output entirely on-device, providing privacy by design. It has multilanguage support, and is currently available in these languages: Czech, English, French, German, Greek, Italian, Polish, Russian, Slovenian, Spanish, Swedish and Ukrainian. Open to contributions :-D https://github.com/Stypox/dicio-android Download https://f-droid.org/packages/org.stypox.dicio https://github.com/Stypox/dicio-android/releases https://play.google.com/store/apps/details?id=org.stypox.dicio Skills Currently Dicio answers questions about: search: looks up information on DuckDuckGo (and in the future more engines) - Search for Dicio weather: collects weather information from OpenWeatherMap - What's the weather like? lyrics: shows Genius lyrics for songs - What's the song that goes we will we will rock you? open: opens an app on your device - Open NewPipe calculator: evaluates basic calculations - What is four thousand and two times three minus a million divided by three hundred? telephone: view and call contacts - Call Tom timer: set, query and cancel timers - Set a timer for five minutes current time: query current time - What time is it? navigation: opens the navigation app at the requested position - Take me to New York, fifteenth avenue media: play, pause, previous, next song Speech to text Dicio uses Vosk as its speech to text (#STT) engine. In order to be able to run on every phone small models are employed, weighing ~50MB. The download from here starts automatically whenever needed, so the app language can be changed seamlessly.

5,410 views

Hashtags

#voice #assistant #android #stt

Posted Aug 7

Find similar View

WhisperTux Simple #voice#dictation application for #Linux. Uses whisper.cpp for offline speech-to-text transcription. No fancy GPUs are required although whisper.cpp is capable of using them if available. Once your speech is transcribed, it is sent to a ydotool daemon that will write the text into the focused application. Features Local speech-to-text processing via whisper.cpp (no cloud dependencies) No expensive hardware required (works well on a plain x86 laptop with AVX instructions) Global keyboard shortcuts for system-wide operation Automatic text injection into focused applications Configurable whisper models and shortcuts https://github.com/cjams/whispertux #assistant#speech#stt

5,320 views

Hashtags

#voice #dictation #linux #assistant #speech #stt

Posted Oct 6

Find similar View

#Linux Desktop application that provides live #captioning FUTO Fellowship program interview; linux captions software 👉 Live Captions github: https://github.com/abb128/LiveCaptions 🔵 Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf https://www.youtube.com/watch?v=OJPmbcU-Vzo 🔵 FUTO Fellows program: https://futo.org/fellows/ 🔵 FUTO Youtube channel - @futotech ⚠️ Google's breaches of privacy have gone TOO FAR! https://www.youtube.com/watch?v=_vWAF13KigI #speech#recognition#stt#voice

5,550 views

Hashtags

#linux #captioning #speech #recognition #stt #voice