thunlp/Delta-CoMe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/Delta-CoMe)

thunlp / Delta-CoMe

Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024

☆59

Alternatives and similar repositories for Delta-CoMe

Users that are interested in Delta-CoMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thunlp / LoRAFlow
View on GitHub
ACL 2024: LoRA-Flow Dynamic LoRA Fusion for Large Language Models in Generative Tasks
☆25Oct 9, 2024Updated last year
MrZilinXiao / ProxyThinker
View on GitHub
[ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.
☆22Sep 24, 2025Updated 10 months ago
delyan-boychev / imaginet
View on GitHub
☆11Apr 25, 2026Updated 3 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
harveyhuang18 / EMR_Merging
View on GitHub
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆82Mar 1, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Miaow-Lab / RLVR-Linearity
View on GitHub
[arXiv] "Linear Dynamics in the RLVR Training of Large Language Models"
☆17May 25, 2026Updated 2 months ago
YihongT / LLMSynthor
View on GitHub
☆21Jul 3, 2025Updated last year
bzantium / pytorch-PKD-for-BERT-compression
View on GitHub
☆15Sep 10, 2019Updated 6 years ago
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
zwhong714 / Hybrid-Policy-Distillation
View on GitHub
[ICML 2026] Hybrid Policy Distillation (HPD) is a practical distillation framework for reasoning-oriented language models. This repositor…
☆24Apr 24, 2026Updated 3 months ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
chiefovoavicii / MAD-OPD
View on GitHub
Official code for "Breaking the Ceiling in On-Policy Distillation via Multi-Agent Debate" (arXiv:2605.01347).
☆33May 7, 2026Updated 2 months ago
leoli646 / Adapter-X
View on GitHub
Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision
☆11Jul 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DamienIrving / ocean-analysis
View on GitHub
Code used for analysis and visualiation of ocean model data during my postdoc
☆12Mar 1, 2023Updated 3 years ago
evanthebouncy / 20Q-selfplay
View on GitHub
LLM play 20questions with itself
☆13Mar 31, 2023Updated 3 years ago
SparksJoe / Prism
View on GitHub
A Framework for Decoupling and Assessing the Capabilities of VLMs
☆44Jun 28, 2024Updated 2 years ago
wzhuang-xmu / LoSA
View on GitHub
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆25Mar 16, 2025Updated last year
Qcompiler / MIXQ
View on GitHub
MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction
☆94Oct 29, 2024Updated last year
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
bzhangGo / st_from_scratch
View on GitHub
Revisiting End-to-End Speech-to-Text Translation From Scratch
☆13Feb 21, 2023Updated 3 years ago
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
InternLM / Condor
View on GitHub
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆40May 28, 2025Updated last year
qualidea1217 / HiPRAG
View on GitHub
HiPRAG (Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation) is a reinforcement learning method designed fo…
☆26Oct 10, 2025Updated 9 months ago
CLUEbenchmark / SuperCLUE-Llama3-Chinese
View on GitHub
Llama3开源模型中文版-全方位测评，基于SuperCLUE基准 | Llama3 Chinese Evaluation with SuperCLUE
☆16Apr 21, 2024Updated 2 years ago
RSG-Group / hyper-background
View on GitHub
Easily change the background of your Hyper terminal!
☆11Nov 29, 2024Updated last year
MengRes / Uncertain-Label
View on GitHub
This repository includes the introduction to uncertain label in Chest X-Ray diagnosis.
☆10Oct 20, 2024Updated last year
EnnengYang / Efficient-WEMoE
View on GitHub
Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.
☆16Oct 28, 2024Updated last year
radi-cho / RSTOD
View on GitHub
Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.
☆17Feb 27, 2023Updated 3 years ago
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Unit26Y21 / UrbanLab
View on GitHub
Spitzers Architecture School Urban Lab for Unit 26. This repository explores designing and codifying urban systems from the bottom up in …
☆14Mar 29, 2022Updated 4 years ago
ZKI-PH-ImageAnalysis / Next-Generation-Loss
View on GitHub
☆12Jan 8, 2025Updated last year
Cornell-RL / drpo
View on GitHub
Dateset Reset Policy Optimization
☆30Apr 12, 2024Updated 2 years ago
sustech-nlp / SPPO
View on GitHub
[ACL 2026 Oral] SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks official repos.
☆26May 18, 2026Updated 2 months ago
FAVOR-Bench / FAVOR-Bench
View on GitHub
Accepted By The 39th Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track
☆25Nov 17, 2025Updated 8 months ago
xverse-ai / XVERSE-V-13B
View on GitHub
☆78May 6, 2024Updated 2 years ago
thunlp / ChartCoder
View on GitHub
[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
☆79Dec 8, 2025Updated 7 months ago