bigai-nlco/RuleReasoner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigai-nlco/RuleReasoner)

bigai-nlco / RuleReasoner

[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

☆39

Alternatives and similar repositories for RuleReasoner

Users that are interested in RuleReasoner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bigai-nlco / ReflectEvo
View on GitHub
Official Repo for ReflectEvo
☆21Jun 16, 2025Updated last year
bigai-nlco / RouterLens
View on GitHub
[EMNLP 2025] RouterLens
☆29Sep 15, 2025Updated 10 months ago
bigai-nlco / Awesome-AI-Memory
View on GitHub
TMLR | This survey presents a comprehensive and structured synthesis of memory in LLMs and MLLMs, organizing the literature into a cohesi…
☆37Jan 28, 2026Updated 5 months ago
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
bigai-nlco / TokenSwift
View on GitHub
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆126May 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bigai-nlco / Native-Parallel-Reasoner
View on GitHub
[ICML 2026] Reasoning in Parallelism via Self-Distilled RL
☆113Jun 28, 2026Updated 3 weeks ago
OmniMMI / OmniMMI
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆23Jul 14, 2026Updated last week
bigai-nlco / UltraVoice
View on GitHub
Official Repository of UltraVoice
☆63Oct 28, 2025Updated 8 months ago
SLIT-AI / FuseChat-3.0
View on GitHub
☆18Apr 18, 2025Updated last year
humanlaya / OneMillion-Bench
View on GitHub
Evals Harness for $OneMillion-Bench
☆48Apr 21, 2026Updated 3 months ago
bigai-nlco / LatentSeek
View on GitHub
Official Repository of LatentSeek
☆85Jun 6, 2025Updated last year
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆18Apr 2, 2025Updated last year
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆19Apr 7, 2026Updated 3 months ago
bigai-nlco / VideoLLaMB
View on GitHub
[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
☆87Feb 27, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
yuleiqin / RAIF
View on GitHub
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆32Oct 9, 2025Updated 9 months ago
bigai-ai / ICE
View on GitHub
【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…
☆56Apr 2, 2025Updated last year
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
patrick-tssn / LM-Research-Hub
View on GitHub
Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language mo…
☆19Mar 19, 2025Updated last year
bigai-nlco / VideoTGB
View on GitHub
[EMNLP 2024] A Video Chat Agent with Temporal Prior
☆33Mar 2, 2025Updated last year
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
Henry839 / PaperMaster
View on GitHub
☆15Apr 14, 2026Updated 3 months ago
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
QingFei1 / R-Search
View on GitHub
[ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
☆35Jan 4, 2026Updated 6 months ago
liujch1998 / memo-trap
View on GitHub
☆23Jan 25, 2023Updated 3 years ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
HY-SpongeBob / HY-SpongeBob
View on GitHub
☆26May 26, 2026Updated 2 months ago
CraftJarvis / MC-TextWorld
View on GitHub
Text world based on Minecraft rules.
☆18May 13, 2024Updated 2 years ago
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
patrick-tssn / Awesome-Colorful-LLM
View on GitHub
Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…
☆128May 7, 2026Updated 2 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
Leey21 / A-Data-Centric-Study
View on GitHub
☆18Mar 2, 2026Updated 4 months ago
AIGeeksGroup / PresentAgent-2
View on GitHub
PresentAgent-2: Towards Generalist Multimodal Presentation Agents
☆17Jun 5, 2026Updated last month
bigai-ai / tong-geometry
View on GitHub
☆45Feb 4, 2026Updated 5 months ago
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year