MDK8888 / GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
☆687Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for GPTFast
- Training LLMs with QLoRA + FSDP☆1,418Updated last week
- ☆892Updated last month
- ☆448Updated 7 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆960Updated 3 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,170Updated last week
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,045Updated 6 months ago
- ☆641Updated this week
- ☆718Updated 2 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,336Updated 7 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆845Updated 3 months ago
- LLM Analytics☆615Updated last month
- Automatically evaluate your LLMs in Google Colab☆559Updated 6 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆975Updated 5 months ago
- ☆470Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆811Updated this week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆493Updated 3 months ago
- A simple, performant and scalable Jax LLM!☆1,532Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,634Updated this week
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,529Updated 4 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆701Updated last week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,351Updated 4 months ago
- Easily embed, cluster and semantically label text datasets☆462Updated 7 months ago
- ReFT: Representation Finetuning for Language Models☆1,159Updated 2 weeks ago
- A library for making RepE control vectors☆481Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,057Updated 2 months ago
- Stateful load balancer custom-tailored for llama.cpp☆563Updated this week
- Train Models Contrastively in Pytorch☆546Updated this week