zjunlp/KnowRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjunlp/KnowRL)

zjunlp / KnowRL

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

☆48

Alternatives and similar repositories for KnowRL

Users that are interested in KnowRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjunlp / WorldMind
View on GitHub
Aligning Agentic World Models via Knowledgeable Experience Learning
☆37May 15, 2026Updated 2 months ago
nusnlp / FSPO
View on GitHub
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆26Oct 31, 2025Updated 8 months ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆126May 6, 2025Updated last year
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆23Feb 17, 2025Updated last year
zjunlp / InnoEval
View on GitHub
[ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem
☆28Jun 21, 2026Updated 3 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zjunlp / LightThinker
View on GitHub
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆165Jun 22, 2026Updated 3 weeks ago
zjunlp / InstructCell
View on GitHub
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following
☆33Jan 15, 2025Updated last year
zjunlp / LabVLA
View on GitHub
LabVLA: Grounding Vision–Language–Action Models in Scientific Laboratories
☆90Jul 4, 2026Updated 2 weeks ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
zjunlp / ReCode
View on GitHub
[AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates
☆25Jul 1, 2025Updated last year
hengzzzhou / ReSo
View on GitHub
☆25Jan 29, 2026Updated 5 months ago
uw-nsl / safechain
View on GitHub
[ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities
☆30Apr 2, 2025Updated last year
zjunlp / SemEval2021Task4
View on GitHub
The 4th rank system of the SemEval 2021 Task4.
☆10May 7, 2022Updated 4 years ago
satrams / rent-rl
View on GitHub
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
☆42Oct 31, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LiaoMengqi / E3-RL4LLMs
View on GitHub
[ EMNLP 2025 Main ] Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
☆17Nov 7, 2025Updated 8 months ago
OceanGPT / OceanGym
View on GitHub
OceanGym: A Benchmark Environment for Underwater Embodied Agents
☆133Jul 3, 2026Updated 2 weeks ago
zjukg / RTQA
View on GitHub
[Paper][EMNLP 2025] RTQA : Recursive Thinking for Complex Temporal Knowledge Graph Question Answering with Large Language Models
☆17Jan 29, 2026Updated 5 months ago
simonucl / PolySkill
View on GitHub
Official implementation of PolySkill, a framework that enables web agents to learn generalizable and compositional skills through polymor…
☆15Jul 6, 2026Updated 2 weeks ago
HanNight / AdaCAD
View on GitHub
Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"
☆16Mar 2, 2026Updated 4 months ago
javiferran / sae_entities
View on GitHub
☆78Mar 6, 2025Updated last year
ZJUFanLab / KANO
View on GitHub
Code and data for the Nature Machine Intelligence paper "Knowledge graph-enhanced molecular contrastive learning with functional prompt".
☆11May 16, 2023Updated 3 years ago
zjunlp / LookAheadTuning
View on GitHub
[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews
☆17Dec 14, 2025Updated 7 months ago
zjunlp / BiasEdit
View on GitHub
[TrustNLP@NAACL 2025] BiasEdit: Debiasing Stereotyped Language Models via Model Editing
☆18Sep 30, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JDing0521 / GraphOTTER
View on GitHub
☆21Dec 7, 2024Updated last year
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
zjunlp / KnowUnDo
View on GitHub
[EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models
☆48Jan 23, 2025Updated last year
RUCAIBox / R1-Searcher-plus
View on GitHub
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆81May 25, 2025Updated last year
zjunlp / MemBase
View on GitHub
A Comprehensive Benchmarking Framework for Long-Term Conversational Memory Layers
☆42Jun 29, 2026Updated 3 weeks ago
uservan / ThinkPO
View on GitHub
☆17Aug 1, 2025Updated 11 months ago
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
zepingyu0512 / awesome-LLM-neuron
View on GitHub
☆36Jun 13, 2025Updated last year
jlko / long_hallucinations
View on GitHub
Codebase for reproducing the experiments of the semantic uncertainty paper (paragraph-length experiments).
☆83Apr 12, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
yueyu1030 / actune
View on GitHub
[NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…
☆15Nov 16, 2022Updated 3 years ago
ShuheSH / A-Survey-of-the-Reasoning-Abilities-of-LLMs
View on GitHub
☆28Mar 4, 2025Updated last year
microsoft / ConstrainedReasoner
View on GitHub
☆13Aug 26, 2024Updated last year
xyliu-cs / StateLM
View on GitHub
[ICLR'26] Official Open-source Implementation of StateLM
☆20Feb 13, 2026Updated 5 months ago
The-Swarm-Corporation / AgentParse
View on GitHub
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆18Oct 13, 2025Updated 9 months ago
THUNLP-MT / CODIS
View on GitHub
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
☆13Oct 14, 2024Updated last year
tfmortie / uaml
View on GitHub
Uncertainty-aware classification.
☆17Jun 28, 2022Updated 4 years ago