weiyifan1023/AutoTIR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weiyifan1023/AutoTIR)

weiyifan1023 / AutoTIR

Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"

☆54

Alternatives and similar repositories for AutoTIR

Users that are interested in AutoTIR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

weiyifan1023 / senator
View on GitHub
NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
☆66Nov 21, 2025Updated 8 months ago
weiyifan1023 / Neeko
View on GitHub
Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"
☆140Jul 23, 2025Updated last year
yongchao98 / R1-Code-Interpreter
View on GitHub
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
☆45Feb 9, 2026Updated 5 months ago
lfy79001 / Awesome-Table-QA
View on GitHub
A comprehensive paper list of Table-based Question Answering.
☆40Sep 1, 2023Updated 2 years ago
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆70May 13, 2025Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
lfy79001 / TableQAKit
View on GitHub
A Toolkit for Table-based Question Answering
☆117Oct 19, 2023Updated 2 years ago
WENGSYX / LMTuner
View on GitHub
LMTuner: Make the LLM Better for Everyone
☆38Sep 21, 2023Updated 2 years ago
Kwai-Klear / RLEP
View on GitHub
RL with Experience Replay
☆58Jul 27, 2025Updated 11 months ago
RUCKBReasoning / CodeRM
View on GitHub
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'
☆27May 16, 2025Updated last year
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆72Jan 28, 2026Updated 5 months ago
qhjqhj00 / MetaAgent
View on GitHub
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆47Sep 3, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wzhouad / context-faithful-llm
View on GitHub
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Mar 23, 2023Updated 3 years ago
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
DynaMath / DynaMath
View on GitHub
A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models
☆30Nov 25, 2024Updated last year
quanshr / AugCon
View on GitHub
[AAAI 2025]Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
☆30Mar 17, 2025Updated last year
RUCAIBox / CIR
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆26Mar 28, 2026Updated 3 months ago
Xnhyacinth / NesyCD
View on GitHub
[AAAI 2025] Neural-Symbolic Collaborative Distillation: Advancing Small Language Models for Complex Reasoning Tasks
☆12Jun 19, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
benjaminzwhite / reasoning-models
View on GitHub
Experiments with reasoning models, training techniques, papers
☆30Updated this week
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
SharkSpicy-NLP / SR-KI
View on GitHub
SR-KI: Scalable and Real-Time Knowledge Integration into LLMs via Supervised Attention
☆56Dec 6, 2025Updated 7 months ago
QingFei1 / R-Search
View on GitHub
[ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
☆35Jan 4, 2026Updated 6 months ago
JingMog / THOR
View on GitHub
[ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆33Feb 26, 2026Updated 5 months ago
Liac-li / MM-self-improve-qwen2vl
View on GitHub
☆13Dec 9, 2024Updated last year
RUC-NLPIR / Tool-Light
View on GitHub
Toward Effective Tool-Integrated Reasoning via Self-Evolved Preference Learning
☆34Sep 30, 2025Updated 9 months ago
luyaojie / E3C
View on GitHub
End-to-End Neural Event Coreference Resolution
☆11Jun 18, 2023Updated 3 years ago
KnowledgeXLab / O2-Searcher
View on GitHub
[TMLR 2026] A Searching-based Agent Model for Open-Domain Open-Ended Question Answering
☆39Jun 20, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
RUCAIBox / R1-Searcher-plus
View on GitHub
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆82May 25, 2025Updated last year
dependentsign / Awesome-LLM-based-Evaluators
View on GitHub
✨✨Latest Papers about LLM-based Evaluators
☆32Feb 26, 2026Updated 5 months ago
MurrayTom / SG-Bench
View on GitHub
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
☆26Nov 29, 2024Updated last year
ethz-spylab / rlhf-poisoning
View on GitHub
Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"
☆67Apr 24, 2024Updated 2 years ago
lemon0830 / promptCSE
View on GitHub
code for promptCSE, emnlp 2022
☆11Apr 10, 2023Updated 3 years ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago