bytedance / FTRLLinks
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
☆32Updated last month
Alternatives and similar repositories for FTRL
Users that are interested in FTRL are comparing it to the libraries listed below
Sorting:
- ☆35Updated last month
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆64Updated 4 months ago
- ☆30Updated 2 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆172Updated last month
- ☆29Updated 2 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆125Updated 5 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 3 months ago
- ☆22Updated this week
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆48Updated last month
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆25Updated 2 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆164Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆69Updated 3 weeks ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆252Updated last week
- The demo, code and data of FollowRAG☆74Updated 2 months ago
- Towards a Unified View of Large Language Model Post-Training☆111Updated last week
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆25Updated 3 weeks ago
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆34Updated last month
- Scaling Preference Data Curation via Human-AI Synergy☆106Updated 2 months ago
- ☆59Updated 3 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆56Updated 2 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆56Updated 3 months ago
- ☆127Updated 2 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆252Updated 2 weeks ago
- ☆205Updated last month
- A Comprehensive Library for Memory of LLM-based Agents.☆74Updated 4 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆102Updated 3 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆96Updated 5 months ago