ZetangForward / LCM_Stack

Code for paper: Long cOntext aliGnment via efficient preference Optimization

☆13

Alternatives and similar repositories for LCM_Stack

Users that are interested in LCM_Stack are comparing it to the libraries listed below

Sorting:

john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 7 months ago
jinzhuoran / RAG-RewardBench
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆16Updated 4 months ago
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆12Updated 5 months ago
yale-nlp / refdpo
☆16Updated 9 months ago
Infini-AI-Lab / S2FT
☆17Updated 4 months ago
open-compass / GPassK
Official Repository of Are Your LLMs Capable of Stable Reasoning?
☆25Updated last month
RM-R1-UIUC / RM-R1
☆63Updated last week
uservan / ThinkPO
☆20Updated 2 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆36Updated 2 months ago
DualityRL / multi-attempt
☆18Updated 2 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆29Updated last month
dvlab-research / Q-LLM
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
☆50Updated 10 months ago
VITA-Group / Ms-PoE
"Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…
☆29Updated last year
chtmp223 / suri
Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)
☆22Updated 6 months ago
bigai-nlco / CREAM
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆17Updated 7 months ago
NuoJohnChen / JudgeLRM
☆24Updated last month
JayZhang42 / SLED
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆25Updated 5 months ago
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆24Updated 3 months ago
HypherX / Evolution-Analysis
☆22Updated 5 months ago
UCSB-NLP-Chang / KVLink
☆14Updated last month
chenllliang / MMEvalPro
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆24Updated 7 months ago
RUCKBReasoning / CodeRM
The code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"
☆19Updated 4 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
PKU-ML / LongPPL
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆72Updated last month
LAMDASZ-ML / Self-Backtracking
☆45Updated 3 months ago
Zanette-Labs / SpeculativeRejection
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
☆44Updated 6 months ago
linkedin / ControlLLM
Control LLM
☆14Updated last month
ZetangForward / L-CITEEVAL
L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?
☆23Updated 6 months ago
thunlp / SparsingLaw
The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".
☆21Updated 6 months ago
MozerWang / AMPO
[arxiv: 2505.02156] Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents
☆16Updated last week