Building modular LMs with parameter-efficient fine-tuning.
☆115May 7, 2026Updated last month
Alternatives and similar repositories for mttl
Users that are interested in mttl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆93Feb 27, 2024Updated 2 years ago
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Nov 26, 2023Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Jul 1, 2023Updated 2 years ago
- ☆131Aug 18, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆95Jul 25, 2024Updated last year
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 5 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- ☆11Nov 13, 2024Updated last year
- ☆215Feb 3, 2024Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆459Sep 6, 2023Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆59Aug 25, 2024Updated last year
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Jul 29, 2025Updated 11 months ago
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Jun 22, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Nov 14, 2023Updated 2 years ago
- Official implementation for "Parameter-Efficient Fine-Tuning Design Spaces"☆27Jan 4, 2023Updated 3 years ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- A flexible framework for running experiments with PyTorch models in a simulated Federated Learning (FL) environment.☆15Aug 11, 2023Updated 2 years ago
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆51Mar 17, 2024Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- ☆15Jul 15, 2023Updated 2 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆35Jun 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Apr 17, 2024Updated 2 years ago
- Model Stock: All we need is just a few fine-tuned models☆129Aug 9, 2025Updated 10 months ago
- Code Repository for the NeurIPS 2021 paper: "Self-Supervised Representation Learning on Neural Network Weights for Model Characteristic P…☆22Jul 10, 2024Updated last year
- ☆18Aug 19, 2024Updated last year
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆56May 10, 2023Updated 3 years ago
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆10Oct 12, 2024Updated last year
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆18May 12, 2024Updated 2 years ago
- ☆14Apr 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆19Dec 4, 2023Updated 2 years ago
- On the Effectiveness of Parameter-Efficient Fine-Tuning☆39Nov 4, 2023Updated 2 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 5 years ago
- ☆23Mar 31, 2023Updated 3 years ago
- Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Model…☆59Oct 2, 2025Updated 8 months ago