CharlesQ9 / Self-Evolving-AgentsLinks
☆343Updated 2 weeks ago
Alternatives and similar repositories for Self-Evolving-Agents
Users that are interested in Self-Evolving-Agents are comparing it to the libraries listed below
Sorting:
- ☆187Updated last week
- ☆311Updated 2 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆236Updated last week
- ☆271Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆118Updated 6 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆232Updated 3 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆156Updated 2 months ago
- Awesome Agent Training☆210Updated 2 weeks ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆285Updated 2 weeks ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 5 months ago
- ☆187Updated 2 months ago
- A version of verl to support tool use☆333Updated this week
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆220Updated this week
- The official code of “Agentic Reinforced Policy Optimization”, an agentic RL algorithm optimization.☆482Updated this week
- ☆329Updated last week
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆479Updated 2 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆245Updated 3 months ago
- ☆204Updated last week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆558Updated 4 months ago
- ☆155Updated 7 months ago
- A Comprehensive Survey on Long Context Language Modeling☆172Updated last month
- ☆325Updated 3 weeks ago
- MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.☆202Updated last week
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆188Updated 5 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆228Updated 3 weeks ago
- ☆160Updated 3 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆244Updated 4 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆188Updated this week
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆203Updated 4 months ago