@dumbledorerambling · Post #4605 · 02/25/2026, 07:05 PM
#read
Hashtags
TGINSIGHT SIMILAR POSTS
Source channel @githubtrending · Post #14958 · Jul 14
#python#agent#agentic_ai#grpo#kimi_ai#llms#lora#qwen#qwen3#reinforcement_learning#rl ART is a tool that helps you train smart agents for real-world tasks using reinforcement learning, especially with the GRPO method. The standout feature is RULER, which lets you skip the hard work of designing reward functions by using a large language model to automatically score how well your agent is doing—just describe your task, and RULER takes care of the rest. This makes building and improving agents much faster and easier, works for any task, and often performs as well as or better than hand-crafted rewards. You can install ART with a simple command and start training agents right away, even on your own computer or with cloud resources. https://github.com/OpenPipe/ART
Search: #read
@dumbledorerambling · Post #4605 · 02/25/2026, 07:05 PM
#read
Hashtags
@dumbledorerambling · Post #4604 · 02/25/2026, 04:08 PM
哈哈哈哈哈哈哈 #read
Hashtags
@dumbledorerambling · Post #4603 · 02/24/2026, 03:03 PM
#read
Hashtags
@dumbledorerambling · Post #4592 · 02/12/2026, 03:18 PM
爆笑🤣以及又有一些新角度~ #read
Hashtags
@dumbledorerambling · Post #4590 · 02/11/2026, 08:53 PM
哇从来没有从这个角度思考建筑的重要性~当然怎样的社会结构也就会催生出怎样的建筑形式 #read
Hashtags
@dumbledorerambling · Post #4449 · 10/24/2025, 04:19 PM
☝️跟上面这歌词 对上了?🤣 #read
Hashtags
@dumbledorerambling · Post #4447 · 10/23/2025, 08:35 PM
#read
Hashtags
@dumbledorerambling · Post #4441 · 10/21/2025, 09:17 PM
写得好精彩~两条旋律交织,“危险”逐步逼近hhh #read
Hashtags
@dumbledorerambling · Post #4434 · 10/19/2025, 12:44 PM
大脑的脑补能力既是希望的来源也是痛苦的来源 (btw感觉小时候看这本书完全没看懂啊 #read
Hashtags
@dumbledorerambling · Post #4426 · 10/17/2025, 05:02 PM
(香油看的书算不算 #read
Hashtags
@dumbledorerambling · Post #4402 · 10/06/2025, 07:17 PM
哈哈哈哈炒肉丝·炒干丝 #read
Hashtags
@dumbledorerambling · Post #4337 · 06/12/2025, 02:13 AM
#read
Hashtags