TGTGInsighttelegram intelligenceLIVE / telegram public index
Back to channels
GitHub Trends avatar

TGINSIGHT CHAT

GitHub Trends

@githubtrending

Technologies

See what the GitHub community is most excited about today. A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel. Author and maintainer: https://github.com/katursis

Subscribers1.0万Current channel subscribers
Tracked posts1,000Indexed post count
Recent reach1,515Sum of recent post views
Recent posts

Recent posts

Tag: #mllm · 3 posts

当前筛选 #mllm清除筛选

Posted Feb 28

#python#agent#android#app#automation#copilot#gui#mllm#mobile#mobile_agents#multimodal#multimodal_agent#multimodal_large_language_models Mobile-Agent-v3.5 is Alibaba's top GUI agent family using GUI-Owl 1.5 models (2B to 235B sizes) for automating desktop, mobile, and browser tasks like stock checks, bookings, or document creation with planning, reflection, and memory. Try free online demos on ModelScope or Bailian, or use limited-time APIs—no setup needed. It leads 20+ benchmarks for real-world use. You benefit by saving time on repetitive tasks, boosting productivity, and handling complex operations hands-free across devices. https://github.com/X-PLUG/MobileAgent

558 views

Posted Aug 15

#python#mllm#point_clouds#scene_understanding#spatial_intelligence SpatialLM is a powerful 3D language model that turns complex 3D point cloud data from videos, RGBD images, or LiDAR into clear, structured 3D scene layouts showing walls, doors, windows, and objects with labels. It works without needing special equipment and can detect user-specified object categories. This helps you understand and analyze indoor spaces better, useful for robotics, navigation, and 3D design. You can run it on your data, visualize results, and even customize detection tasks easily, making 3D scene understanding more accessible and flexible for many applications. https://github.com/manycore-research/SpatialLM

418 views

Posted Apr 27

#python#agent_computer_interface#ai_agents#computer_automation#computer_use#grounding#gui_agents#in_context_reinforcement_learning#memory#mllm#planning#retrieval_augmented_generation Agent S2 is a smart AI assistant that handles computer tasks by breaking them into smaller steps and using specialized tools for each part, making it highly adaptable and efficient across different systems like Windows and Android. It outperforms other AI tools in completing complex tasks, learns from experience, and adjusts plans as needed, helping users automate digital work more reliably and effectively. https://github.com/simular-ai/Agent-S

539 views