microsoft / DeepSpeedExamples

Example models using DeepSpeed

☆6,220

Alternatives and similar repositories for DeepSpeedExamples:

Users that are interested in DeepSpeedExamples are comparing it to the libraries listed below

OptimalScale / LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
☆8,320Updated last week
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
Instruction Tuning with GPT-4
☆4,254Updated last year
microsoft / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆36,255Updated this week
huggingface / trl
Train transformer language models with reinforcement learning.
☆10,609Updated this week
THUDM / GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
☆7,679Updated last year
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆16,978Updated this week
baichuan-inc / Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
☆5,681Updated 5 months ago
THUDM / GLM
GLM (General Language Model)
☆3,220Updated last year
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,239Updated last year
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,168Updated 7 months ago
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,567Updated last year
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆11,119Updated last month
OpenGVLab / LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,799Updated 10 months ago
AutoGPTQ / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆4,620Updated this week
Facico / Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
☆4,151Updated 2 months ago
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆11,109Updated this week
LianjiaTech / BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,017Updated 3 months ago
FreedomIntelligence / LLMZoo
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡
☆2,923Updated last year
modelscope / modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
☆7,246Updated this week
baichuan-inc / Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
☆4,117Updated 2 months ago
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆6,522Updated this week
project-baize / baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
☆3,169Updated 10 months ago
open-compass / opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆4,496Updated last week
AetherCortex / Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,615Updated last year
microsoft / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆1,957Updated 3 weeks ago
ymcui / Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
☆18,651Updated 8 months ago
PhoebusSi / Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…
☆2,659Updated last year
THUDM / VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
☆4,125Updated 4 months ago
InternLM / InternLM
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
☆6,675Updated this week