MDK8888 / GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
☆687Updated 4 months ago
Alternatives and similar repositories for GPTFast:
Users that are interested in GPTFast are comparing it to the libraries listed below
- ☆913Updated this week
- Training LLMs with QLoRA + FSDP☆1,436Updated 2 months ago
- ☆446Updated 9 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆965Updated 5 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,207Updated 3 weeks ago
- ☆664Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,358Updated 9 months ago
- Automatically evaluate your LLMs in Google Colab☆575Updated 8 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆684Updated 9 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆732Updated this week
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,099Updated 8 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,580Updated 6 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆895Updated this week
- ☆493Updated 4 months ago
- A simple, performant and scalable Jax LLM!☆1,587Updated this week
- Visualize the intermediate output of Mistral 7B☆333Updated 11 months ago
- Minimalistic large language model 3D-parallelism training☆1,386Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,382Updated last month
- Train Models Contrastively in Pytorch☆568Updated last week
- A bagel, with everything.☆315Updated 9 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,879Updated this week
- Generate textbook-quality synthetic LLM pretraining data☆492Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆704Updated last year
- Evaluation suite for LLMs☆328Updated last month
- Serving multiple LoRA finetuned LLM as one☆1,012Updated 8 months ago
- ☆413Updated last year
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- ☆774Updated 4 months ago