hpcaitech/ColossalAI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hpcaitech/ColossalAI)

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

☆41,413

Alternatives and similar repositories for ColossalAI

Users that are interested in ColossalAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deepspeedai / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆42,718Updated this week
lm-sys / FastChat
View on GitHub
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,490May 1, 2026Updated 2 months ago
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,251Jul 17, 2024Updated 2 years ago
zai-org / ChatGLM-6B
View on GitHub
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
☆41,024Jun 27, 2024Updated 2 years ago
meta-llama / llama
View on GitHub
Inference code for Llama models
☆59,511Jan 26, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ymcui / Chinese-LLaMA-Alpaca
View on GitHub
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
☆18,941Apr 19, 2026Updated 2 months ago
Vision-CAIR / MiniGPT-4
View on GitHub
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,662Sep 2, 2024Updated last year
huggingface / transformers
View on GitHub
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆162,626Updated this week
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,274Oct 16, 2024Updated last year
langchain-ai / langchain
View on GitHub
The agent engineering platform.
☆141,839Updated this week
LAION-AI / Open-Assistant
View on GitHub
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,383Aug 17, 2024Updated last year
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆86,341Updated this week
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,910Jul 29, 2024Updated last year
hiyouga / LlamaFactory
View on GitHub
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆73,301Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Significant-Gravitas / AutoGPT
View on GitHub
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…
☆185,558Updated this week
OptimalScale / LMFlow
View on GitHub
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,484May 22, 2026Updated last month
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,073Updated this week
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆50,868Updated this week
OpenMOSS / MOSS
View on GitHub
An open-source tool-augmented conversational language model from Fudan University
☆12,156May 27, 2026Updated last month
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,399Updated this week
nomic-ai / gpt4all
View on GitHub
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆77,388May 27, 2025Updated last year
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,159Jan 23, 2026Updated 5 months ago
chenfei-wu / TaskMatrix
View on GitHub
☆34,055Jan 6, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,460Updated this week
mlc-ai / mlc-llm
View on GitHub
Universal LLM Deployment Engine with ML Compilation
☆22,949Updated this week
microsoft / JARVIS
View on GitHub
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆25,047Jul 29, 2025Updated 11 months ago
zai-org / GLM-130B
View on GitHub
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
☆7,655Jul 25, 2023Updated 2 years ago
BlinkDL / ChatRWKV
View on GitHub
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
☆9,494May 29, 2026Updated last month
hpcaitech / Open-Sora
View on GitHub
Open-Sora: Democratizing Efficient Video Production for All
☆29,186Apr 9, 2026Updated 3 months ago
togethercomputer / OpenChatKit
View on GitHub
☆8,982Apr 9, 2024Updated 2 years ago
chatchat-space / Langchain-Chatchat
View on GitHub
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…
☆38,426Nov 10, 2025Updated 8 months ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,850Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,359Oct 28, 2024Updated last year
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆120,500Updated this week
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,252Mar 2, 2026Updated 4 months ago
deepspeedai / DeepSpeedExamples
View on GitHub
Example models using DeepSpeed
☆6,830Updated this week
lllyasviel / ControlNet
View on GitHub
Let us control diffusion models!
☆34,000Feb 25, 2024Updated 2 years ago
databrickslabs / dolly
View on GitHub
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
☆10,804Jun 30, 2023Updated 3 years ago
haotian-liu / LLaVA
View on GitHub
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,923Aug 12, 2024Updated last year