r-three / phatgooseView external linksLinks
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Feb 27, 2024Updated last year
Alternatives and similar repositories for phatgoose
Users that are interested in phatgoose are comparing it to the libraries listed below
Sorting:
- ☆30Sep 28, 2023Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- A library for squeakily cleaning and filtering language datasets.☆49Jul 10, 2023Updated 2 years ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆57Aug 25, 2024Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- ☆415Nov 2, 2023Updated 2 years ago
- Building modular LMs with parameter-efficient fine-tuning.☆114Jan 18, 2026Updated 3 weeks ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- ☆209Feb 3, 2024Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- ☆37Nov 27, 2025Updated 2 months ago
- QLoRA with Enhanced Multi GPU Support☆37Aug 8, 2023Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- ☆68May 26, 2024Updated last year
- Codebase for decoding compressed trust.☆25May 7, 2024Updated last year
- A repository for research on medium sized language models.☆77May 23, 2024Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 4 months ago
- ☆13May 9, 2024Updated last year
- ☆273Oct 31, 2023Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆144Sep 10, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- ☆10Oct 24, 2024Updated last year
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- ☆10Apr 16, 2021Updated 4 years ago
- ☆12Oct 22, 2024Updated last year
- YourAICHAT☆13Aug 16, 2023Updated 2 years ago
- MPI Code Generation through Domain-Specific Language Models☆14Nov 19, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆147Sep 20, 2024Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- [COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition☆668Jul 22, 2024Updated last year
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- ☆91Aug 18, 2024Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year