luyug / magix
Supercharge huggingface transformers with model parallelism.
☆75Updated last month
Related projects ⓘ
Alternatives and complementary repositories for magix
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆71Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- Code for Zero-Shot Tokenizer Transfer☆115Updated 3 weeks ago
- Utilities for Training Very Large Models☆56Updated last month
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆76Updated 2 years ago
- Embedding Recycling for Language models☆38Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆50Updated this week
- 🚢 Data Toolkit for Sailor Language Models☆82Updated 4 months ago
- ☆68Updated 3 months ago
- Retrieval-Augmented Generation battle!☆44Updated last week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 4 months ago
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆29Updated last month
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- ☆48Updated last month
- SILO Language Models code repository☆80Updated 8 months ago
- ☆71Updated 6 months ago
- ☆38Updated 7 months ago
- ☆46Updated this week
- Truly flash T5 realization!☆54Updated 6 months ago
- ☆40Updated 6 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆149Updated 4 months ago
- ☆112Updated last month