[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆43Feb 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for MergeBench
Users that are interested in MergeBench are comparing it to the libraries listed below
Sorting:
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆16Sep 25, 2024Updated last year
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- ☆16May 14, 2024Updated last year
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 3 years ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 9 months ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆36Oct 1, 2025Updated 4 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆26Nov 25, 2024Updated last year
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- ☆44Feb 11, 2026Updated 2 weeks ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆32Feb 18, 2026Updated last week
- Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆43Oct 30, 2025Updated 4 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Crosslingual Question Answering for African Languages☆30Sep 27, 2024Updated last year
- ☆81Mar 17, 2022Updated 3 years ago
- Download flickr8k, flickr30k image caption datasets☆42Feb 6, 2024Updated 2 years ago
- State-of-the-art paired encoder and decoder models (17M-1B params)☆58Aug 6, 2025Updated 6 months ago
- Mathematical foundations of data analysis, Winter semester 22-23☆13Jan 31, 2023Updated 3 years ago
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- Some of my practices on Algorithms : ) 这个仓库保存着我在 LeetCode、剑指Offer 上的一些解答,代码中保留了必要的注释。不一定是最优的解答,但力保代码简洁易懂。后续还会整合其他题库,如若发现什么错误,希望你能告诉我或帮助我…☆11Dec 3, 2024Updated last year
- ☆15Feb 12, 2026Updated 2 weeks ago
- Implementation of various handwritten text line segmentation☆10Jan 6, 2020Updated 6 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- ☆53Feb 10, 2025Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- 抓取汽车之家全站☆10Dec 26, 2019Updated 6 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- Utilities to parse type information and JSDoc annotations from TypeScript source files, and render Markdown documentation☆12Jun 24, 2023Updated 2 years ago