aadityasingh / HARPLinks

☆22

Alternatives and similar repositories for HARP

Users that are interested in HARP are comparing it to the libraries listed below

Sorting:

sail-sg / VeriFree
Reinforcing General Reasoning without Verifiers
☆90Updated 3 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆35Updated last month
shenao-zhang / SELM
The official implementation of Self-Exploring Language Models (SELM)
☆64Updated last year
RobertCsordas / moeut
☆85Updated last year
LAMDASZ-ML / Self-Backtracking
☆50Updated 8 months ago
ScalingIntelligence / large_language_monkeys
☆106Updated last year
imagination-research / lbt
[NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
☆55Updated 10 months ago
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆61Updated last year
sail-sg / feedback-conditional-policy
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆46Updated 2 weeks ago
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆47Updated 7 months ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated last year
complex-reasoning / RPG
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
☆51Updated last week
test-time-interaction / TTI
☆62Updated 4 months ago
wmn-231314 / diffusion-data-constraint
Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…
☆101Updated last month
TsinghuaC3I / SSRL
SSRL: Self-Search Reinforcement Learning
☆147Updated last month
haozheji / exact-optimization
ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment
☆57Updated last year
Infini-AI-Lab / gsm_infinite
☆55Updated 4 months ago
yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆26Updated last week
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆87Updated last year
katiekang1998 / reasoning_generalization
☆33Updated 9 months ago
SynthLabsAI / big-math
A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
☆65Updated 7 months ago
IBM / ColPret
Efficient Scaling laws and collaborative pretraining.
☆18Updated last month
HKUNLP / critic-rl
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆114Updated 5 months ago
MLE-Dojo / MLE-Dojo
☆74Updated last month
hkust-nlp / B-STaR
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆85Updated 4 months ago
SalesforceAIResearch / LaTRO
☆122Updated 7 months ago
sail-sg / dice
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆44Updated 6 months ago
SalesforceAIResearch / GemFilter
☆85Updated 9 months ago
justinlovelace / Diffusion-Guided-LM
☆28Updated last year
allenai / IFBench
☆80Updated 3 weeks ago