dvmazur / mixtral-offloadingLinks

Run Mixtral-8x7B models in Colab or consumer desktops

☆2,327

Alternatives and similar repositories for mixtral-offloading

Users that are interested in mixtral-offloading are comparing it to the libraries listed below

Sorting:

myshell-ai / JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
☆986Updated last year
intel / intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…
☆2,164Updated last year
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,527Updated 11 months ago
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,135Updated 2 months ago
huggingface / optimum-nvidia
☆1,007Updated 8 months ago
brevdev / launchables
Collection of notebook guides created by the Brev.dev team!
☆1,798Updated 2 weeks ago
mistralai / megablocks-public
☆865Updated last year
mistralai / mistral-finetune
☆3,037Updated last year
MDK8888 / GPTFast
Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.
☆685Updated last year
openai / weak-to-strong
☆2,544Updated last year
neulab / prompt2model
prompt2model - Generate Deployable Models from Natural Language Instructions
☆2,008Updated 10 months ago
ise-uiuc / magicoder
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
☆2,048Updated 11 months ago
McGill-NLP / webllama
Llama-3 agents that can browse the web by following instructions and talking to you
☆1,408Updated 10 months ago
SqueezeAILab / LLMCompiler
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
☆1,771Updated last year
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,398Updated last year
Vahe1994 / AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…
☆1,299Updated 2 months ago
mistralai-sf24 / hackathon
☆446Updated last year
arcee-ai / mergekit
Tools for merging pretrained large language models.
☆6,394Updated last month
S-LoRA / S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
☆1,864Updated last year
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,094Updated last year
microsoft / promptbench
A unified evaluation framework for large language models
☆2,736Updated 2 weeks ago
jiaweizzhao / GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,612Updated last year
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,137Updated last week
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated 2 years ago
CStanKonrad / long_llama
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,462Updated last year
Alpha-VLLM / LLaMA2-Accessory
An Open-source Toolkit for LLM Development
☆2,790Updated 9 months ago
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆771Updated last year
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,406Updated last month
marella / ctransformers
Python bindings for the Transformer models implemented in C/C++ using GGML library.
☆1,876Updated last year
FasterDecoding / Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,650Updated last year