Reason-Wang/NAT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Reason-Wang/NAT)

Reason-Wang / NAT

[NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents"

☆28

Alternatives and similar repositories for NAT

Users that are interested in NAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LibrAIResearch / libra-eval
View on GitHub
☆23May 20, 2025Updated last year
WangHanLinHenry / STeCa
View on GitHub
(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"
☆29Mar 2, 2026Updated 4 months ago
yao8839836 / cp
View on GitHub
☆13Feb 17, 2025Updated last year
FranxYao / Complexity-Based-Prompting
View on GitHub
Complexity Based Prompting for Multi-Step Reasoning
☆17Mar 10, 2023Updated 3 years ago
Yifan-Song793 / ETO
View on GitHub
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆168Oct 30, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Junjie-Ye / ToolEyes
View on GitHub
[COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
☆74May 13, 2025Updated last year
mbzuai-nlp / finchain
View on GitHub
A symbolic benchmark for verifiable chain-of-thought financial reasoning. Includes executable templates, 58 topics across 12 domains, and…
☆28Dec 26, 2025Updated 6 months ago
anchen1011 / FireAct
View on GitHub
FireAct: Toward Language Agent Fine-tuning
☆296Oct 22, 2023Updated 2 years ago
zjunlp / AutoAct
View on GitHub
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆238Jan 13, 2025Updated last year
zjunlp / KnowAgent
View on GitHub
[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
☆260Jan 29, 2025Updated last year
Kguo-cs / ccil
View on GitHub
☆17Aug 2, 2023Updated 2 years ago
snw2021 / LLM_Unlearning_Papers
View on GitHub
☆26Nov 25, 2023Updated 2 years ago
Atrix256 / FrequencySpaceImages
View on GitHub
Working with images in frequency space
☆10Nov 5, 2020Updated 5 years ago
TIGER-AI-Lab / MAmmoTH2
View on GitHub
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆146Oct 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BitSecret / HyperGNet
View on GitHub
Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.
☆16Sep 23, 2025Updated 9 months ago
RUCAIBox / JiuZhang3.0
View on GitHub
The code and data for the paper JiuZhang3.0
☆49May 26, 2024Updated 2 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
Linzwcs / AFT
View on GitHub
☆13Jan 22, 2025Updated last year
MetaCopilot / dseval
View on GitHub
☆33Jun 24, 2024Updated 2 years ago
Agent-One-Lab / AgentFly
View on GitHub
Scalable and extensible reinforcement learning for LM agents.
☆122May 6, 2026Updated 2 months ago
dinobby / MAgICoRE
View on GitHub
☆23Sep 19, 2024Updated last year
s2e-lab / Code-Smell-Code-Generation
View on GitHub
Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".
☆11Oct 4, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JSJeong-me / GPT-Table
View on GitHub
GPT Table Semantic Parsing with complex & non-intuitive structure.
☆17Jul 16, 2025Updated last year
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
wujwyi / CMC
View on GitHub
[NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training
☆14Oct 25, 2024Updated last year
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆31Dec 12, 2024Updated last year
VinAIResearch / RecGPT
View on GitHub
RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)
☆42Sep 22, 2024Updated last year
Jl-wei / guing
View on GitHub
A mobile GUI search engine using a vision-language model
☆15May 5, 2025Updated last year
wssun / PromptCS
View on GitHub
A Prompt Learning Framework for Source Code Summarization
☆14Dec 26, 2023Updated 2 years ago
dair-iitd / TourismQA
View on GitHub
☆13Nov 8, 2021Updated 4 years ago
TianjinYellow / UGTs-LoG
View on GitHub
This is the official code for UGTs.
☆13Feb 8, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Data-reindeer / MolScaling
View on GitHub
Source code for the paper 'Uncovering Neural Scaling Laws in Molecular Representation Learning' (NeurIPS 2023 Datasets and Benchmarks).
☆14Dec 2, 2023Updated 2 years ago
Cydia2018 / AS-ViT
View on GitHub
Adaptive Sparse ViT
☆16Aug 1, 2023Updated 2 years ago
YangLing0818 / SuperCorrect-llm
View on GitHub
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆90Mar 23, 2025Updated last year
habedi / Snowball-Sampling
View on GitHub
Snowball Sampling in NetworX
☆14Nov 9, 2018Updated 7 years ago
zhao-ht / LearnAct
View on GitHub
Code for paper Empowering Large Language Model Agents through Action Learning
☆34Aug 8, 2024Updated last year
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year