hadasah/btm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hadasah/btm)

hadasah / btm

☆79

Alternatives and similar repositories for btm

Users that are interested in btm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kernelmachine / demix-data
View on GitHub
Benchmark API for Multidomain Language Modeling
☆25Aug 26, 2022Updated 3 years ago
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
kernelmachine / demix
View on GitHub
DEMix Layers for Modular Language Modeling
☆54Feb 25, 2026Updated 4 months ago
mlfoundations / patching
View on GitHub
Patching open-vocabulary models by interpolating weights
☆91Sep 28, 2023Updated 2 years ago
CHLee0801 / TemporalWikiDatasets
View on GitHub
☆13Apr 24, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nverma1 / merging-text-transformers
View on GitHub
Code for "Merging Text Transformers from Different Initializations"
☆20Feb 2, 2025Updated last year
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
kernelmachine / cbtm
View on GitHub
Code repository for the c-BTM paper
☆109Sep 26, 2023Updated 2 years ago
RobertCsordas / moeut
View on GitHub
☆93Aug 18, 2024Updated last year
kernelmachine / silo-lm
View on GitHub
SILO Language Models code repository
☆83Feb 23, 2024Updated 2 years ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
jjzha / cartography-al
View on GitHub
Code base for the EMNLP 2021 Findings paper: Cartography Active Learning
☆14Jun 3, 2025Updated last year
CarperAI / squeakily
View on GitHub
A library for squeakily cleaning and filtering language datasets.
☆50Jul 10, 2023Updated 3 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
llm-merging / LLM-Merging
View on GitHub
LLM-Merging: Building LLMs Efficiently through Merging
☆208Sep 24, 2024Updated last year
malteos / clp-transfer
View on GitHub
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Jan 25, 2023Updated 3 years ago
UKPLab / on-emergence
View on GitHub
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Jan 9, 2025Updated last year
yanaiela / pararel
View on GitHub
☆49Jan 21, 2024Updated 2 years ago
yueyu1030 / Patron
View on GitHub
[ACL 2023] The code for our ACL'23 paper Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Pr…
☆24Jun 1, 2024Updated 2 years ago
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
zouharvi / tokenization-scorer
View on GitHub
Simple-to-use scoring function for arbitrarily tokenized texts.
☆50Feb 19, 2025Updated last year
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
cisnlp / MEXA
View on GitHub
[ACL 2025] 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment
☆11Apr 6, 2025Updated last year
jb-01 / LoRA-TLE
View on GitHub
Token-level adaptation of LoRA matrices for downstream task generalization.
☆15Apr 14, 2024Updated 2 years ago
MaLA-LM / GlotEval
View on GitHub
GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way
☆18Nov 4, 2025Updated 8 months ago
nicola-decao / KnowledgeEditor
View on GitHub
Code for Editing Factual Knowledge in Language Models
☆142Jan 28, 2022Updated 4 years ago
ptoulis / implicit-sgd
View on GitHub
Using stochastic gradient descent (SGD) with explicit and implicit updates to fit large-scale statistical models.
☆16Aug 21, 2014Updated 11 years ago
wjbmattingly / biospacy
View on GitHub
☆22Jan 2, 2023Updated 3 years ago
rabeehk / compacter
View on GitHub
☆131Aug 18, 2022Updated 3 years ago
seonghyeonye / Flipped-Learning
View on GitHub
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆117Jun 28, 2025Updated last year
yikee / FLIP
View on GitHub
Small Reward Models via Backward Inference
☆21May 25, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huiwy / reflection-on-trees
View on GitHub
☆14May 9, 2024Updated 2 years ago
emrgnt-cmplxty / SmolTrainer
View on GitHub
☆21Oct 6, 2023Updated 2 years ago
facebookresearch / ModelRatatouille
View on GitHub
Recycling diverse models
☆47Jan 18, 2023Updated 3 years ago
mlpc-ucsd / BERT_Convolutions
View on GitHub
(ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.
☆21Jul 13, 2022Updated 4 years ago
ethancaballero / broken_neural_scaling_laws
View on GitHub
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Oct 29, 2023Updated 2 years ago
simsal0r / mixture-of-decision-trees
View on GitHub
Mixture of Decision Trees for Interpretable Machine Learning
☆11Sep 2, 2021Updated 4 years ago
RAIVNLab / MatFormer-OLMo
View on GitHub
Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…
☆31Nov 14, 2023Updated 2 years ago