[ACL 2026] Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
☆52Apr 6, 2026Updated last month
Alternatives and similar repositories for FTRL
Users that are interested in FTRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 3 years ago
- ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities☆15Feb 11, 2025Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 11 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆30Aug 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆136Oct 9, 2025Updated 7 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆60Nov 5, 2025Updated 6 months ago
- ☆45May 3, 2026Updated 3 weeks ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆54Oct 29, 2024Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆60May 28, 2024Updated last year
- ☆30May 24, 2025Updated last year
- Official implementation of Browse-Master, a tool-augmented web-search agent.☆31Aug 22, 2025Updated 9 months ago
- ☆53Oct 10, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆22May 3, 2025Updated last year
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆88Jan 21, 2026Updated 4 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆65Apr 28, 2026Updated 3 weeks ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆23Feb 13, 2025Updated last year
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆36Aug 20, 2025Updated 9 months ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆51Oct 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆39Jun 18, 2025Updated 11 months ago
- A modified version of the cart-pole OpenAI Gym environment for testing different control policies☆13May 4, 2026Updated 3 weeks ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆385Apr 3, 2026Updated last month
- Model-Agnostic Meta-Learning in PyTorch☆12Jul 31, 2020Updated 5 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 7 months ago
- [IROS 2021] ADD: A Fine-grained Dynamic Inference Architecture for Semantic Image Segmentation☆10May 3, 2022Updated 4 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆36Jul 3, 2025Updated 10 months ago
- ☆15May 4, 2024Updated 2 years ago
- ☆18Apr 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent (ACL 2026 Main)☆274Dec 11, 2025Updated 5 months ago
- Developed a high-performance trading engine using Rust, leveraging its powerful features for low-level systems programming. Engineered to…☆23Nov 9, 2024Updated last year
- "DeepResearch-Eval: An End-to-End Evaluation Framework for DeepResearch Systems"☆45Oct 16, 2025Updated 7 months ago
- ☆16Dec 10, 2023Updated 2 years ago
- The code and data of We-Math 2.0.☆170Aug 30, 2025Updated 8 months ago
- Beer Game implemented as an OpenAI gym environment.☆17Aug 4, 2019Updated 6 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago