[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents"
☆28Mar 14, 2024Updated last year
Alternatives and similar repositories for NAT
Users that are interested in NAT are comparing it to the libraries listed below
Sorting:
- ☆13Feb 17, 2025Updated last year
- A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…☆25Dec 26, 2025Updated 2 months ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆25Feb 12, 2026Updated 2 weeks ago
- ☆18May 30, 2023Updated 2 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73May 13, 2025Updated 9 months ago
- ☆21Feb 26, 2024Updated 2 years ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆25Nov 16, 2023Updated 2 years ago
- FireAct: Toward Language Agent Fine-tuning☆292Oct 22, 2023Updated 2 years ago
- ☆23Sep 19, 2024Updated last year
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆234Jan 13, 2025Updated last year
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆257Jan 29, 2025Updated last year
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- ☆31Jun 24, 2024Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 6 months ago
- Code for paper Empowering Large Language Model Agents through Action Learning☆33Aug 8, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆67Sep 13, 2025Updated 5 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆87Mar 23, 2025Updated 11 months ago
- ☆11Dec 5, 2024Updated last year
- Fake NEWS detector using LIAR dataset.☆11Aug 19, 2019Updated 6 years ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- A Light, Concise and Powerful Hexo's theme☆11Jul 15, 2022Updated 3 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆75Nov 4, 2025Updated 3 months ago
- ☆10Aug 15, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- ☆10Jul 6, 2023Updated 2 years ago
- DREEM Relates Every Entities' Motion (DREEM). Global Tracking Transformers for biological multi-object tracking.☆13Updated this week
- ☆13Aug 3, 2024Updated last year
- A lightweight repository for exploring and experimenting with AI agents☆14Jul 22, 2025Updated 7 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆270Apr 18, 2024Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- ☆11May 6, 2025Updated 9 months ago