HKUDS / SepLLMLinks

[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"

☆532

Alternatives and similar repositories for SepLLM

Users that are interested in SepLLM are comparing it to the libraries listed below

Sorting:

LZY-the-boys / Twin-Merging
[NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
☆136Updated 4 months ago
Facico / GOAT-PEFT
[ICML2025] Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment
☆121Updated last month
FanbinLu / STEVE-R1
R1-like Computer-use Agent
☆80Updated 4 months ago
RLHFlow / Online-DPO-R1
Codebase for Iterative DPO Using Rule-based Rewards
☆255Updated 3 months ago
cmriat / l0
A scalable, end-to-end training pipeline for general-purpose agents
☆349Updated last month
dle666 / R-CoT
Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models
☆177Updated 9 months ago
URSA-MATH / URSA-MATH
☆63Updated 4 months ago
yfzhang114 / r1_reward
✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning
☆246Updated 2 months ago
gersteinlab / ML-Bench
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…
☆302Updated this week
HJYao00 / Mulberry
Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS
☆1,208Updated 4 months ago
S1s-Z / NOVA
[ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"
☆20Updated 2 weeks ago
trestad / Noisy-Rewards-in-Learning-to-Reason
☆103Updated 2 months ago
Wuyxin / collabllm
(ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators
☆182Updated 2 weeks ago
xinghaow99 / BitStack
[ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
☆36Updated 5 months ago
mlpod / OpenSFT
☆45Updated 4 months ago
KodCode-AI / kodcode
✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork
☆251Updated 2 weeks ago
HKUST-KnowComp / CoT-ICL-Eval
Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning
☆51Updated 3 months ago
ZJU-REAL / GUI-G2
A Gaussian dense reward framework for GUI grounding training
☆204Updated 2 weeks ago
S1s-Z / CANOE
Code for "Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning"
☆33Updated 3 weeks ago
rllm-team / rllm
Pytorch Library for Relational Table Learning with LLMs.
☆432Updated last month
Tencent-Hunyuan / ArtifactsBenchmark
☆221Updated this week
HKUST-KnowComp / Awesome-LLM-Scientific-Discovery
From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
☆211Updated this week
WisconsinAIVision / YoChameleon
🦎 Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)
☆143Updated 2 months ago
ByteDance-Seed / EvaLearn
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…
☆422Updated 3 weeks ago
AlgRUC / JittorGeometric
JittorGeometric is a Jittor-based graph machine learning library.
☆160Updated this week
OpenDCAI / RARE
Official implementation of RARE: Retrieval-Augmented Reasoning Modeling
☆183Updated 2 months ago
Alpha-Innovator / InternAgent
When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification
☆478Updated last week
dvlab-research / VisionThink
Efficient Reasoning Vision Language Models
☆337Updated 2 weeks ago
pat-jj / DeepRetrieval
[COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning
☆601Updated last month
Zefan-Cai / R-KV
R-KV: Redundancy-aware KV Cache Compression for Reasoning Models
☆1,097Updated last month