LCM-Lab / LCM_StackLinks

Code for paper: Long cOntext aliGnment via efficient preference Optimization

☆13

Alternatives and similar repositories for LCM_Stack

Users that are interested in LCM_Stack are comparing it to the libraries listed below

Sorting:

yale-nlp / refdpo
☆15Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated last year
DualityRL / multi-attempt
☆19Updated 6 months ago
yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆24Updated 2 months ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆30Updated 2 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆40Updated 7 months ago
Infini-AI-Lab / S2FT
☆19Updated 9 months ago
jinzhuoran / RAG-RewardBench
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆16Updated 9 months ago
HypherX / Evolution-Analysis
☆23Updated 9 months ago
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆16Updated 9 months ago
mathllm / Step-Controlled_DPO
☆22Updated last year
dvlab-research / Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
☆55Updated last year
jiwonsong-dev / ReasoningPathCompression
Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"
☆22Updated 4 months ago
linkedin / ControlLLM
Control LLM
☆19Updated 6 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
uservan / ThinkPO
☆18Updated 2 months ago
leezythu / FocusLLM
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆43Updated 9 months ago
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆14Updated last year
SalesforceAIResearch / GemFilter
☆86Updated 8 months ago
xufangzhi / phi-Decoding
[ACL 2025] An inference-time decoding strategy with adaptive foresight sampling
☆104Updated 4 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆34Updated last month
thunlp / SparsingLaw
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆26Updated 10 months ago
tianyi-lab / C3PO
Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆18Updated 5 months ago
Linking-ai / SCOPE
(ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation
☆34Updated 4 months ago
yayayacc / MUR
☆45Updated last week
LAMDASZ-ML / Self-Backtracking
☆48Updated 7 months ago
bigai-nlco / CREAM
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆18Updated 11 months ago
SjJ1017 / CiteLab
☆16Updated 2 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
Kwai-Klear / KlearReasoner
Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization
☆71Updated last week