microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

☆36,255

Alternatives and similar repositories for DeepSpeed:

Users that are interested in DeepSpeed are comparing it to the libraries listed below

huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆16,978Updated this week
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆11,109Updated this week
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆37,496Updated this week
microsoft / DeepSpeedExamples
Example models using DeepSpeed
☆6,220Updated last week
meta-llama / llama
Inference code for Llama models
☆57,227Updated 4 months ago
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆11,119Updated last month
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆29,738Updated 6 months ago
hpcaitech / ColossalAI
Making large AI models cheaper, faster and more accessible
☆39,013Updated last week
huggingface / transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆137,641Updated this week
gradio-app / gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
☆35,268Updated this week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆15,064Updated this week
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆13,005Updated last week
microsoft / JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆23,871Updated 3 months ago
Vision-CAIR / MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,536Updated 4 months ago
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆20,584Updated last week
OptimalScale / LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,320Updated last week
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,758Updated 5 months ago
run-llama / llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
☆38,057Updated this week
huggingface / trl
Train transformer language models with reinforcement learning.
☆10,609Updated this week
chenfei-wu / TaskMatrix
☆34,540Updated last year
modelscope / modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
☆7,246Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆8,178Updated this week
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,168Updated 7 months ago
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆8,910Updated this week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆33,809Updated this week
langchain-ai / langchain
🦜🔗 Build context-aware reasoning applications
☆98,422Updated this week
FMInference / FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,254Updated 2 months ago
facebookresearch / ImageBind
ImageBind One Embedding Space to Bind Them All
☆8,476Updated 5 months ago
ymcui / Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
☆18,651Updated 8 months ago