bytedance / FTRLLinks
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
☆41Updated last month
Alternatives and similar repositories for FTRL
Users that are interested in FTRL are comparing it to the libraries listed below
Sorting:
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆127Updated 7 months ago
- ☆31Updated 3 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆177Updated 2 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆113Updated 3 months ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆26Updated 3 months ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆53Updated last week
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆158Updated 3 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated 2 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆270Updated this week
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆51Updated 2 months ago
- ☆89Updated 5 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆76Updated last month
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆135Updated 6 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆39Updated 9 months ago
- ☆73Updated 3 months ago
- ☆50Updated 2 months ago
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆25Updated 4 months ago
- ☆147Updated last week
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆66Updated 5 months ago
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆464Updated last week
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆25Updated last month
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆96Updated last week
- ☆105Updated 4 months ago
- ☆43Updated last week
- [NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model☆56Updated last week
- A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architect…☆114Updated 3 weeks ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆59Updated 3 months ago
- ☆214Updated 2 months ago
- ☆49Updated last year