Find similar content

Source channel @githubtrending · Post #15539 · Mar 5

#python#agent#llm#llm_agent#llm_reasoning#machine_learning_systems#mlsys#reinforcement_learning#rl AReaL is a free, open-source system for fast asynchronous reinforcement learning to train large AI models in math, coding, search, and agents. It decouples generation and training for up to 2.77x speedup, stable performance, and easy setup on single or 1000+ GPUs with algorithms like GRPO/PPO. Install via git/pip, run examples like GSM8K math instantly. You benefit by building top AI agents affordably and quickly, reproducing results with shared data/models, saving time/money vs. slow synchronous tools. https://github.com/inclusionAI/AReaL

Hashtags

#python #agent #llm #llm_agent #llm_reasoning #machine_learning_systems #mlsys #reinforcement_learning #rl

Results

1 similar post found

Search: #localllama

当前筛选 #localllama清除筛选

是芙莉莲

@ireallyhatetheworld · Post #1459 · 03/16/2026, 01:23 PM

Find similar View

Qwen3.5-9B-Claude-4.6-Opus-Uncensored-Distilled-GGUF: 面向本地部署的轻量级创意与推理模型 🔞可用于本地涩涩等场景 • 基于 Qwen 3.5 9B，并融入 Claude Opus 4.6 蒸馏思路，主打更强的创意表达、对话表现与角色扮演场景 • 提供 GGUF 与低显存友好的 Q4_K_M 量化版本，作者反馈在 RTX 3060 12 GB 上可达约 38 tok/s，适合本地聊天、游戏 NPC 与 Home Lab 部署 • 默认关闭 thinking 以提升通用聊天体验，需要时可在 LM Studio 中手动开启；模型采用 Apache 2.0 许可证，便于社区测试与二次集成 https://www.reddit.com/r/LocalLLaMA/comments/1runlpf/qwen359bclaude46opusuncensoreddistilledgguf #AI#Uncensored#本地大模型#模型蒸馏#GGUF#Qwen#Claude#LMStudio#量化模型#低显存部署#角色扮演#LocalLLaMA

Hashtags

#ai #uncensored #本地大模型 #模型蒸馏 #gguf #qwen #claude #lmstudio #量化模型 #低显存部署 #角色扮演 #localllama