tongxuluo/LeaP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tongxuluo/LeaP)

tongxuluo / LeaP

Code, Data and Model for Paper "Learning from Peers in Reasoning Models"

☆26

Alternatives and similar repositories for LeaP

Users that are interested in LeaP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lfy79001 / RegHNT
View on GitHub
Code for COLING 2022 long paper: Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-…
☆22Dec 15, 2022Updated 3 years ago
tongxuluo / prts
View on GitHub
Code and Model for NeurIPS 2024 Spotlight Paper "Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training…
☆44Oct 16, 2024Updated last year
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
weiyifan1023 / AutoTIR
View on GitHub
Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"
☆54Sep 4, 2025Updated 10 months ago
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
YJiangcm / BMC
View on GitHub
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Jan 26, 2025Updated last year
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
yuexy / ST-AR
View on GitHub
☆14Sep 22, 2025Updated 10 months ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 11 months ago
amy-77 / ParisKV
View on GitHub
🔥 [ICML'26] ParisKV: Fast and Drift-Robust KV-Cache Retrieval for Long-Context LLMs
☆30Jun 29, 2026Updated 3 weeks ago
weiyifan1023 / senator
View on GitHub
NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs
☆66Nov 21, 2025Updated 8 months ago
AxelSorensenDev / Eevee
View on GitHub
An Easy Annotation Tool for Natural Language Processing
☆12May 17, 2024Updated 2 years ago
zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
hkust-nlp / RL-Verifier-Robustness
View on GitHub
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
☆24Oct 7, 2025Updated 9 months ago
weiyifan1023 / MenatQA
View on GitHub
Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…
☆14Apr 7, 2025Updated last year
uservan / speculative_thinking
View on GitHub
☆34Oct 13, 2025Updated 9 months ago
RUCAIBox / LSVCR
View on GitHub
☆14Apr 1, 2024Updated 2 years ago
kobayashikanna01 / Chain-of-Discussion
View on GitHub
☆11May 28, 2024Updated 2 years ago
abdelfattah-lab / SplitReason
View on GitHub
☆20Mar 18, 2026Updated 4 months ago
TrustedLLM / UnKE
View on GitHub
☆24Feb 18, 2025Updated last year
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
scalable-model-editing / unified-model-editing
View on GitHub
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆29Dec 16, 2024Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
w-yibo / VTC-R1
View on GitHub
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning.
☆26Jul 20, 2026Updated last week
seoneun / T5-Question-Generation
View on GitHub
SQuAD Question Generation module based on T5-large
☆18Aug 26, 2022Updated 3 years ago
thunlp / ACDiT
View on GitHub
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
☆42Jan 29, 2026Updated 6 months ago
MozerWang / DEMO
View on GitHub
[ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling
☆22Dec 16, 2024Updated last year
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆70May 13, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
DavidHerel / semantics-preserving-encoder
View on GitHub
Python library providing a simple, fully supervised sentence embedding technique for textual adversarial attacks.
☆13Dec 13, 2023Updated 2 years ago
LivingFutureLab / DeltaBench
View on GitHub
☆46Mar 4, 2025Updated last year
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
LCM-Lab / L-CITEEVAL
View on GitHub
Evaluating the faithfulness of long-context language models
☆30Oct 21, 2024Updated last year
Kwai-Klear / RLEP
View on GitHub
RL with Experience Replay
☆58Jul 27, 2025Updated last year
dataSnail / RSpapers
View on GitHub
papers about recommender system.
☆10May 18, 2021Updated 5 years ago
PhoneHarness / PhoneHarness
View on GitHub
PhoneHarness runtime harness for mixed-action phone agents
☆35Jun 17, 2026Updated last month