hzy312/knowledge-r1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hzy312/knowledge-r1)

hzy312 / knowledge-r1

IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

☆70

Alternatives and similar repositories for knowledge-r1

Users that are interested in knowledge-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
weiyifan1023 / Neeko
View on GitHub
Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"
☆140Jul 23, 2025Updated last year
huangyuxiang03 / Locret
View on GitHub
☆14Oct 3, 2024Updated last year
namespace-Pt / UltraGist
View on GitHub
☆18Dec 2, 2024Updated last year
weiyifan1023 / AutoTIR
View on GitHub
Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"
☆54Sep 4, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools
View on GitHub
A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.
☆46Dec 17, 2025Updated 7 months ago
SparkJiao / StructTest
View on GitHub
☆19Jul 24, 2025Updated last year
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
facebookresearch / UniK-QA
View on GitHub
Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering
☆51Aug 2, 2022Updated 3 years ago
Fu-Dayuan / AgentRefine
View on GitHub
(ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning
☆20Nov 22, 2025Updated 8 months ago
RUCAIBox / R1-Searcher
View on GitHub
R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning
☆720Aug 5, 2025Updated 11 months ago
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
chenlong-clock / RULE-Unlearn
View on GitHub
[NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality
☆20Oct 22, 2025Updated 9 months ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HongbangYuan / OmniReward
View on GitHub
☆47Dec 16, 2025Updated 7 months ago
wizardlancet / diagnosis_zero
View on GitHub
diagnosis_zero, R1 Zero reproduce on disease diagnosis
☆32Jul 24, 2025Updated last year
FutureComputing4AI / Hrrformer
View on GitHub
Hrrformer: A Neuro-symbolic Self-attention Model (ICML23)
☆64Oct 8, 2025Updated 9 months ago
DA-Open / DV-World
View on GitHub
[ICML 2026] DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios
☆69Apr 29, 2026Updated 2 months ago
NVIDIA / When2Call
View on GitHub
A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions
☆67Apr 29, 2025Updated last year
ADaM-BJTU / OpenRFT
View on GitHub
OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning
☆157Dec 24, 2024Updated last year
RAG-Gym / RAG-Gym
View on GitHub
Official repository for RAG-Gym
☆124Jul 14, 2026Updated last week
reddy-lab-code-research / PPOCoder
View on GitHub
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆116Jan 9, 2024Updated 2 years ago
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dinobby / MAgICoRE
View on GitHub
☆23Sep 19, 2024Updated last year
RUC-NLPIR / iAgent
View on GitHub
Including 12+ cutting-edge agent systems across multiple research directions
☆35Nov 10, 2025Updated 8 months ago
EduardTalianu / EntropixLab
View on GitHub
entropix style sampling + GUI
☆27Oct 30, 2024Updated last year
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
HKUNLP / SymGen
View on GitHub
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
☆18Oct 21, 2023Updated 2 years ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Job-Bench / job-bench-eval
View on GitHub
Official eval scripts for JobBench
☆29Jul 18, 2026Updated last week
ZNLP / Language-Imbalance-Driven-Rewarding
View on GitHub
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆25Apr 6, 2026Updated 3 months ago
amazon-science / wikiwiki-dataset
View on GitHub
☆11May 11, 2022Updated 4 years ago
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
INK-USC / FiD-ICL
View on GitHub
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆15Jul 24, 2023Updated 3 years ago
Zhitao-He / AgentsCourt
View on GitHub
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)
☆18Dec 30, 2024Updated last year
eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year