AlignInc / aligner-replicationLinks

The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction

☆22

Alternatives and similar repositories for aligner-replication

Users that are interested in aligner-replication are comparing it to the libraries listed below

Sorting:

Zyphra / Zyda_processing
☆39Updated last year
dvlab-research / MR-GSM8K
Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs
☆51Updated last year
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated 2 years ago
mathllm / MathCoder2
☆70Updated last year
bradhilton / o1-chain-of-thought
o1 Chain of Thought Examples
☆33Updated last year
GAIR-NLP / ReAlign
Reformatted Alignment
☆113Updated last year
schauppi / Self-Rewarding-Language-Models
☆48Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
18907305772 / FuseAI
FuseAI Project
☆87Updated 10 months ago
fangyuan-ksgk / Evolutionary-Model-Merge
Unofficial Implementation of Evolutionary Model Merging
☆41Updated last year
xufangzhi / phi-Decoding
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆106Updated 6 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
thomasgauthier / LLM-self-play
Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)
☆29Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated last year
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆147Updated last year
snu-mllab / Context-Memory
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆63Updated last year
SkyworkAI / MindLink
☆98Updated 4 months ago
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆61Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
RobertCsordas / moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆102Updated last year
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆31Updated 4 months ago
WENGSYX / LMTuner
LMTuner: Make the LLM Better for Everyone
☆37Updated 2 years ago
BBuf / RWKV-World-HF-Tokenizer
☆34Updated last year
microsoft / simulated-trial-and-error
☆122Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
GAIR-NLP / Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆81Updated last year
NormXU / Consistent-DynamicNTKRoPE
An Experiment on Dynamic NTK Scaling RoPE
☆64Updated 2 years ago
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆72Updated last year
GeneZC / MiniMA
Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"
☆102Updated last year