Find similar content

Source channel @githubtrending · Post #14958 · Jul 14

#python#agent#agentic_ai#grpo#kimi_ai#llms#lora#qwen#qwen3#reinforcement_learning#rl ART is a tool that helps you train smart agents for real-world tasks using reinforcement learning, especially with the GRPO method. The standout feature is RULER, which lets you skip the hard work of designing reward functions by using a large language model to automatically score how well your agent is doing—just describe your task, and RULER takes care of the rest. This makes building and improving agents much faster and easier, works for any task, and often performs as well as or better than hand-crafted rewards. You can install ART with a simple command and start training agents right away, even on your own computer or with cloud resources. https://github.com/OpenPipe/ART

Hashtags

#python #agent #agentic_ai #grpo #kimi_ai #llms #lora #qwen #qwen3 #reinforcement_learning #rl

Results

1 similar post found

Search: #20b

当前筛选 #20b清除筛选

科技&趣闻&杂记

@kejiqu · Post #3730 · 08/27/2025, 02:50 AM

Find similar View

全球首个 AI 勒索软件现身：滥用 OpenAI 开源模型，跨平台搜索、窃取、加密文件 ESET 报告发现名为 PromptLock 的勒索软件，该软件是全球首个利用 AI 生成恶意代码的案例。PromptLock 调用 gpt-oss:20b 模型，在受感染设备本地生成恶意 Lua 代码，用于跨 Windows、Linux 和 macOS 系统搜索、窃取并加密文件。该模型可在具备 16GB 显存的设备上本地运行，支持自由修改和使用。攻击者可以通过建立内部智能体或隧道，将受害网络连接到外部服务器运行的模型，以规避对本地显存的需求。专家认为 PromptLock 或是概念验证程序，但预示着威胁行为者利用本地或私有 AI 的早期信号。OpenAI 表示已采取措施降低模型被恶意利用的风险。IT之家 🏷#PromptLock#AI#勒索软件#gpt#oss#20b 📢频道👥群组📝投稿

Hashtags

#promptlock #ai #勒索软件 #gpt #oss #20b