bytedance/FTRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/FTRL)

bytedance / FTRL

[ACL 2026] Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

☆52

Alternatives and similar repositories for FTRL

Users that are interested in FTRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shellorley0513 / CriticTool
View on GitHub
[EMNLP 2025] Official Implement of "CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scen…
☆18Sep 2, 2025Updated 10 months ago
zjunlp / OneEdit
View on GitHub
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆20Oct 14, 2024Updated last year
inclusionAI / GroveMoE
View on GitHub
☆24Aug 20, 2025Updated 11 months ago
Mizersy / RepoDeepSearch
View on GitHub
☆44Oct 28, 2025Updated 8 months ago
Rainier-rq / verl-if
View on GitHub
Official implementation of the paper "Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following"
☆40Jan 11, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Junjie-Ye / RoTBench
View on GitHub
[EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
☆15May 13, 2025Updated last year
Junjie-Ye / MulDimIF
View on GitHub
[ACL 2026] A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models
☆23Jul 10, 2026Updated 2 weeks ago
Junjie-Ye / ToolSword
View on GitHub
[ACL 2024] ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
☆15Sep 12, 2024Updated last year
zzwkk / MUA-RL
View on GitHub
MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE
☆65Nov 5, 2025Updated 8 months ago
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
EachSheep / RAGSynth
View on GitHub
The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization
☆21May 26, 2025Updated last year
init0xyz / AdaCQR
View on GitHub
Implementation of AdaCQR(COLING 2025)
☆15Dec 30, 2024Updated last year
quchangle1 / MatchTIR
View on GitHub
The implementation for ACL 2026: MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching.
☆20Apr 18, 2026Updated 3 months ago
OpenIXCLab / CODA
View on GitHub
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning
☆37Aug 28, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
syncdoth / Chain-of-Hindsight-PyTorch
View on GitHub
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
☆11Apr 5, 2023Updated 3 years ago
zjunlp / ReCode
View on GitHub
[AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates
☆25Jul 1, 2025Updated last year
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
amodaresi / MemLLM
View on GitHub
☆13Aug 13, 2024Updated last year
Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
ByteDance-Seed / WideSearch
View on GitHub
WideSearch: Benchmarking Agentic Broad Info-Seeking
☆148Oct 9, 2025Updated 9 months ago
CharlesPikachu / ToolBridge
View on GitHub
ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
☆14Feb 11, 2025Updated last year
RUC-NLPIR / HiRA
View on GitHub
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search [SIGIR 2026]
☆65Jul 4, 2025Updated last year
LianjiaTech / astra
View on GitHub
ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…
☆148Jan 30, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hrwise-nlp / ToolsMeetLLMs
View on GitHub
☆33May 8, 2025Updated last year
SkyworkAI / Skywork-DeepResearch
View on GitHub
☆27Aug 13, 2025Updated 11 months ago
NIL-zhuang / EfficientRAG-official
View on GitHub
Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
☆69Mar 4, 2025Updated last year
qiancheng0 / ToolRL
View on GitHub
☆513Oct 16, 2025Updated 9 months ago
casetext / r-and-r
View on GitHub
Code for the "Long Context Needs Some R&R" paper.
☆12Mar 11, 2024Updated 2 years ago
RUC-NLPIR / Tool-Star
View on GitHub
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆403Apr 3, 2026Updated 3 months ago
alon-albalak / TLiDB
View on GitHub
Transfer Learning in Dialogue Benchmarking Toolkit
☆14Mar 31, 2023Updated 3 years ago
THUDM / TreeRL
View on GitHub
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
☆97Jun 16, 2025Updated last year
multimodal-art-projection / OProver
View on GitHub
☆23May 17, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thunlp / S3Delta
View on GitHub
code for paper Sparse Structure Search for Delta Tuning
☆11Oct 16, 2022Updated 3 years ago
Jiuzhouh / Uncertainty-Aware-Language-Agent
View on GitHub
This is the official repo for Towards Uncertainty-Aware Language Agent.
☆31Aug 15, 2024Updated last year
MiroMindAI / MiroRL
View on GitHub
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆246Aug 27, 2025Updated 10 months ago
Alibaba-NLP / MaskSearch
View on GitHub
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆155May 27, 2025Updated last year
yuyq18 / StepTool
View on GitHub
☆36May 24, 2025Updated last year
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
OPPO-PersonalAI / OAgents
View on GitHub
Implementation for OAgents: An Empirical Study of Building Effective Agents
☆327Oct 13, 2025Updated 9 months ago