A method of ensemble learning for heterogeneous large language models.
☆64Aug 7, 2024Updated last year
Alternatives and similar repositories for DeePEn
Users that are interested in DeePEn are comparing it to the libraries listed below
Sorting:
- ☆28Oct 19, 2022Updated 3 years ago
- ☆17Nov 20, 2024Updated last year
- From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems☆17Nov 23, 2025Updated 3 months ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- ☆27Jul 11, 2024Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…☆15Aug 12, 2024Updated last year
- ☆22Mar 7, 2025Updated 11 months ago
- ☆26Nov 21, 2022Updated 3 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- ☆24Oct 31, 2025Updated 4 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 2 years ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated 11 months ago
- ☆14Dec 28, 2022Updated 3 years ago
- ☆13Mar 26, 2019Updated 6 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- ☆13Jun 13, 2023Updated 2 years ago
- Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]☆15Feb 24, 2021Updated 5 years ago
- ☆17Jul 5, 2022Updated 3 years ago
- A Comprehensive Benchmark for Robust Multi-image Understanding☆19Sep 4, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 11 months ago
- ☆17May 19, 2023Updated 2 years ago
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆15Apr 3, 2020Updated 5 years ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆122May 19, 2025Updated 9 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆22May 21, 2025Updated 9 months ago
- ☆20Nov 3, 2024Updated last year
- Model merging is a highly efficient approach for long-to-short reasoning.☆100Oct 15, 2025Updated 4 months ago
- ☆33Jun 3, 2025Updated 8 months ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆59May 9, 2025Updated 9 months ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆200Dec 21, 2025Updated 2 months ago
- AutoHallusion Codebase (EMNLP 2024)☆22Dec 6, 2024Updated last year
- ☆31Sep 12, 2025Updated 5 months ago
- Getting Starting with NIMBUS-CORE☆10Dec 16, 2023Updated 2 years ago
- A set of tests for evaluating large-scale algorithms for Wasserstein-1 transport computation (NeurIPS'22).☆24Sep 9, 2024Updated last year
- ☆25Aug 23, 2024Updated last year
- Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".☆21May 25, 2022Updated 3 years ago