TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #15445 · Jan 28

#python#agentic_ai#agents#ai#ai_agents#realtime#stt#tts#video_agents#video_ai#vision_ai#voice_ai Vision Agents is an open-source Python framework by Stream to build real-time AI agents that watch video, listen to audio, and respond instantly with low latency under 30ms. It integrates YOLO, Roboflow, OpenAI, Gemini, and 25+ tools for apps like golf coaching, security cameras detecting theft, or phone assistants. Install easily with `uv add vision-agents`, use free Stream credits, and deploy on any video network. You benefit by quickly creating smart video AI for gaming, safety, or coaching without vendor lock-in, saving time and costs on custom builds. https://github.com/GetStream/Vision-Agents

Results

1 similar post found

Search: #sailfish

当前筛选 #sailfish清除筛选
Libreware

@libreware · Post #1579 · 04/19/2026, 09:15 PM

Speech Note #Linux desktop and #Sailfish OS app for note taking, reading and translating with offline #Speech to Text #stt, Text to Speech #tts and Machine #Translation https://github.com/mkiol/dsnote MPL-2.0 license https://github.com/mkiol/dsnote#how-to-install Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your computer, without using a network connection. Your privacy is always respected. No data is sent to the Internet. Speech Note uses many different processing engines to do its job. Currently these are used: Speech to Text (STT) Coqui STT (a fork of Mozilla DeepSpeech) Vosk whisper.cpp Faster Whisper april-asr Text to Speech (TTS) espeak-ng MBROLA Piper RHVoice Coqui TTS Mimic 3 WhisperSpeech Kokoro Parler-TTS F5-TTS S.A.M. Machine Translation (MT) Bergamot Translator