#python#agent#llm#llm_agent#llm_reasoning#machine_learning_systems#mlsys#reinforcement_learning#rl
AReaL is a free, open-source system for fast asynchronous reinforcement learning to train large AI models in math, coding, search, and agents. It decouples generation and training for up to 2.77x speedup, stable performance, and easy setup on single or 1000+ GPUs with algorithms like GRPO/PPO. Install via git/pip, run examples like GSM8K math instantly. You benefit by building top AI agents affordably and quickly, reproducing results with shared data/models, saving time/money vs. slow synchronous tools.
https://github.com/inclusionAI/AReaL
#tvOS#TestFlight
Surge 5 5.100.0 (3433) is ready to test on tvOS.
What to Test:
- 修正在特定网络情况下,Surge Ponte 转发某一个请求有低概率失败的问题
Official Channel: @SurgeTestFlightFeed
#tvOS#TestFlight
Surge 5 5.100.0 (3426) is ready to test on tvOS.
What to Test:
- 同步 iOS 版本更新
- 优化 Ponte Server 的重试机制,现在在偶发出现 NAT 类型失败后,会定期自动重试尝试恢复了
Official Channel: @SurgeTestFlightFeed