#python#agentic_ai#agents#ai#ai_agents#realtime#stt#tts#video_agents#video_ai#vision_ai#voice_ai
Vision Agents is an open-source Python framework by Stream to build real-time AI agents that watch video, listen to audio, and respond instantly with low latency under 30ms. It integrates YOLO, Roboflow, OpenAI, Gemini, and 25+ tools for apps like golf coaching, security cameras detecting theft, or phone assistants. Install easily with `uv add vision-agents`, use free Stream credits, and deploy on any video network. You benefit by quickly creating smart video AI for gaming, safety, or coaching without vendor lock-in, saving time and costs on custom builds.
https://github.com/GetStream/Vision-Agents
kitty
The fast, feature-rich, GPU based #terminal emulator
Uses GPU and SIMD vector CPU instructions for best in class
Uses threaded rendering for absolutely minimal latency
Performance tradeoffs can be tuned
Capable Scriptable Composable Cross-platform Innovative
To get started see Quickstart.
https://sw.kovidgoyal.net/kitty/
With software, such as "Kitty", it is possible to read documents, images and other graphical formats with these software:
https://github.com/dsanson/termpdf.py
http://www.kraxel.org/blog/linux/fbida/
https://github.com/itsjunetime/tdf
It means, that we no longer need GTK+ (which dropped support for X11), Qt and other graphical toolkits in order to have a full operational computer.
qrcp, transfer files over wifi from your computer to your mobile device by scanning a QR code without leaving the terminal.
#tools#terminal
@thedevs
https://kutt.it/fycGgm
eDEX-UI, a science fiction terminal emulator designed for large touchscreens that runs on all major OSs.
#tools#terminal
@thedevs
https://kutt.it/IaR1au