Find similar content

Source channel @githubtrending · Post #14958 · Jul 14

#python#agent#agentic_ai#grpo#kimi_ai#llms#lora#qwen#qwen3#reinforcement_learning#rl ART is a tool that helps you train smart agents for real-world tasks using reinforcement learning, especially with the GRPO method. The standout feature is RULER, which lets you skip the hard work of designing reward functions by using a large language model to automatically score how well your agent is doing—just describe your task, and RULER takes care of the rest. This makes building and improving agents much faster and easier, works for any task, and often performs as well as or better than hand-crafted rewards. You can install ART with a simple command and start training agents right away, even on your own computer or with cloud resources. https://github.com/OpenPipe/ART

Hashtags

#python #agent #agentic_ai #grpo #kimi_ai #llms #lora #qwen #qwen3 #reinforcement_learning #rl

Results

1 similar post found

Search: #tfdeploy

当前筛选 #tfdeploy清除筛选

djangoproject

@djangoproject · Post #274 · 03/18/2017, 01:48 AM

Find similar View

https://github.com/riga/tfdeploy Google's TensorFlow framework is taking off big-time now that it's at a full 1.0 release. One common question about it: How can I make use of the models I train in TensorFlow without using TensorFlow itself? #Tfdeploy is a partial answer to that question. It exports a trained TensorFlow model to "a simple #NumPy-based callable," meaning the model can be used in Python with Tfdeploy and the the NumPy math-and-stats library as the only dependencies. Most of the operations you can perform in TensorFlow can also be performed in Tfdeploy, and you can extend the behaviors of the library by way of standard Python metaphors (such as overloading a class). Now the bad news: Tfdeploy doesn't support GPU acceleration, if only because NumPy doesn't do that. Tfdeploy's creator suggests using the gNumPy project as a possible replacement. #Machine_learning

Hashtags

#tfdeploy #numpy #machine_learning