☆77Apr 29, 2024Updated last year
Alternatives and similar repositories for btm
Users that are interested in btm are comparing it to the libraries listed below
Sorting:
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆54Updated this week
- ☆26May 30, 2023Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- ☆13Apr 24, 2022Updated 3 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 9 months ago
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- Code repository for the c-BTM paper☆108Sep 26, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- SILO Language Models code repository☆83Feb 23, 2024Updated 2 years ago
- ☆91Aug 18, 2024Updated last year
- ☆13Jun 17, 2025Updated 8 months ago
- ☆48Jan 21, 2024Updated 2 years ago
- decontamination☆26Dec 3, 2025Updated 3 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆74Mar 2, 2024Updated 2 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆11Jun 2, 2024Updated last year
- Code for the paper "Pretrained Models for Multilingual Federated Learning" at NAACL 2022☆11Aug 9, 2022Updated 3 years ago
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Apr 6, 2025Updated 10 months ago
- ☆10Oct 15, 2019Updated 6 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆14Mar 2, 2024Updated 2 years ago
- LLM-Merging: Building LLMs Efficiently through Merging☆209Sep 24, 2024Updated last year
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16May 4, 2023Updated 2 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 4 months ago
- ☆31Dec 13, 2023Updated 2 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- [AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan☆14Oct 18, 2022Updated 3 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆13Nov 21, 2023Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- latest p2p and http addresses for peermaps datasets☆12Jul 14, 2022Updated 3 years ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- ☆54May 8, 2023Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Oct 29, 2023Updated 2 years ago
- ☆131Aug 18, 2022Updated 3 years ago
- Data preparation code for Amber 7B LLM☆93May 10, 2024Updated last year
- [CVPRW 2023] "Many-Task Federated Learning: A New Problem Setting and A Simple Baseline" by Ruisi Cai, Xiaohan Chen, Shiwei Liu, Jayanth …☆13Aug 28, 2023Updated 2 years ago
- Simple next-token-prediction for RLHF☆229Sep 30, 2023Updated 2 years ago