OrangeInSouth/DeePEn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OrangeInSouth/DeePEn)

OrangeInSouth / DeePEn

A method of ensemble learning for heterogeneous large language models.

☆62

Alternatives and similar repositories for DeePEn

Users that are interested in DeePEn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OrangeInSouth / LSSD
View on GitHub
☆28Oct 19, 2022Updated 3 years ago
starrYYxuan / UniTE
View on GitHub
☆17Nov 20, 2024Updated last year
zkzhou126 / AI-for-Research
View on GitHub
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
☆19Jun 29, 2026Updated 3 weeks ago
liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
whongzhong / MMHalSnowball
View on GitHub
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆18Aug 12, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xydaytoy / EVA
View on GitHub
☆14Apr 16, 2024Updated 2 years ago
xwgeng / RewriteNAT
View on GitHub
Learning to Rewrite for Non-Autoregressive Neural Machine Translation
☆21Dec 23, 2021Updated 4 years ago
USTC-StarTeam / ZIP
View on GitHub
arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.
☆28Jun 10, 2026Updated last month
LLMkvsys / rethink-kv-compression
View on GitHub
☆24Mar 7, 2025Updated last year
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
ritikamangla / QSalience
View on GitHub
https://arxiv.org/abs/2404.10917
☆14Mar 18, 2025Updated last year
xcfcode / Summarization-Papers
View on GitHub
Summarization Papers
☆1,008Jul 15, 2023Updated 3 years ago
cindyxinyiwang / multiDDS
View on GitHub
Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"
☆23May 26, 2021Updated 5 years ago
ucfnlp / joint-parse-n-summarize
View on GitHub
(AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".
☆15Apr 3, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hahahawu / Long-to-Short-via-Model-Merging
View on GitHub
Model merging is a highly efficient approach for long-to-short reasoning.
☆103Oct 15, 2025Updated 9 months ago
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
ucfnlp / sent-fusion-transformers
View on GitHub
Code, data, and models for the EMNLP 2020 paper "Learning to Fuse Sentences with Transformers for Summarization"
☆16Nov 2, 2022Updated 3 years ago
LHL3341 / MetaLadder
View on GitHub
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)
☆12Apr 18, 2025Updated last year
tanganke / weight-ensembling_MoE
View on GitHub
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆32Jun 7, 2024Updated 2 years ago
jiazhihao / attention_superoptimizer
View on GitHub
An Attention Superoptimizer
☆22Jan 20, 2025Updated last year
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
yaoching0 / GaC
View on GitHub
☆53Oct 8, 2024Updated last year
r-three / fib
View on GitHub
☆26Nov 21, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LARK-AI-Lab / EnvFactory
View on GitHub
The official paper for EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.
☆82Jun 5, 2026Updated last month
skolouri / TopoTrans
View on GitHub
TopoTrans: Optimal Transport meets Topological Data Analysis
☆14Apr 20, 2023Updated 3 years ago
NingMiao / InstaAug
View on GitHub
☆15Dec 28, 2022Updated 3 years ago
rbawden / mt-bigscience
View on GitHub
Evaluation results for Machine Translation within the BigScience project
☆11May 15, 2023Updated 3 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
shawnkx / Fully-NAT
View on GitHub
☆17Jul 5, 2022Updated 4 years ago
muirbench / MuirBench
View on GitHub
A Comprehensive Benchmark for Robust Multi-image Understanding
☆21Sep 4, 2024Updated last year
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
TsinghuaC3I / CoGenesis
View on GitHub
[ACL 2024, Main Conference] CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Fol…
☆15Aug 7, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jinhaoduan / SAR
View on GitHub
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆62Sep 4, 2024Updated last year
cambridge-mlg / neural_diffusion_processes
View on GitHub
☆12Jun 13, 2023Updated 3 years ago
songmzhang / DSKDv2
View on GitHub
The official implementation of the paper "A Dual-Space Framework for General Knowledge Distillation of Large Language Models".
☆18Jan 4, 2026Updated 6 months ago
yegcjs / mixinglaws
View on GitHub
☆113Jul 15, 2025Updated last year
clinicalml / co-llm
View on GitHub
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆128May 7, 2024Updated 2 years ago
Arvid-pku / ATOKE
View on GitHub
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Dec 17, 2023Updated 2 years ago
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year