#python#agent#agentic_ai#grpo#kimi_ai#llms#lora#qwen#qwen3#reinforcement_learning#rl
ART is a tool that helps you train smart agents for real-world tasks using reinforcement learning, especially with the GRPO method. The standout feature is RULER, which lets you skip the hard work of designing reward functions by using a large language model to automatically score how well your agent is doing—just describe your task, and RULER takes care of the rest. This makes building and improving agents much faster and easier, works for any task, and often performs as well as or better than hand-crafted rewards. You can install ART with a simple command and start training agents right away, even on your own computer or with cloud resources.
https://github.com/OpenPipe/ART
Choc Chip Banana Bread ✨🍌🤎
Ingredients:
* 3 large ripe bananas (360–380 g)
* 130 g unsalted butter (will reduce to \~115 g once browned)
* 180 g dark brown sugar (+ 1–2 tbsp for topping)
* 2 large eggs
* 250 g plain flour
* 1 tsp baking soda
* ½ tsp salt
* ½ tsp cinnamon
* 70–100 g dark chocolate, chopped
* Optional: chopped nuts
#dinner
@dishes
My Kind of Girl Dinner 😮💨🍚🥒
Ingredients:
For the salmon:
* 2 salmon fillets (skin removed, cut into cubes)
* 2 tbsp soy sauce
* 1 tbsp oyster sauce
* 1 tsp brown sugar
* 2 tsp minced garlic
* 1 tbsp sriracha
* ¼ tsp chili flakes
For the rice & toppings:
* 100 g sushi rice (cooked)
* 1 tsp apple cider vinegar
* ½ tsp salt
* ½ tsp sugar
* Pinch of chili flakes
* ¼ cucumber, thinly sliced
* ¼ small onion, thinly sliced
* Handful of edamame (cooked)
* Black & white sesame seeds, for topping
#dinner
@dishes