Rayrtfr / fastertransformer_backendLinks

☆9

Alternatives and similar repositories for fastertransformer_backend

Users that are interested in fastertransformer_backend are comparing it to the libraries listed below

Sorting:

void-main / fastertransformer_backend
☆21Updated 2 years ago
THUDM / FasterTransformer
Transformer related optimization, including BERT, GPT
☆39Updated 2 years ago
Rayrtfr / FasterTransformer
Transformer related optimization, including BERT, GPT
☆17Updated 2 years ago
OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆604Updated 2 months ago
mindspore-lab / mindformers
☆172Updated this week
OpenMOSS / CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
☆418Updated 11 months ago
cauyxy / bilivideos
☆52Updated 2 years ago
genggui001 / Megatron-DeepSpeed-Llama
☆84Updated last year
HuangLK / transpeeder
train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism
☆224Updated last year
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
git-cloner / llama2-lora-fine-tuning
llama2 finetuning with deepspeed and lora
☆176Updated 2 years ago
ProjectD-AI / llama_inference
llama inference for tencentpretrain
☆99Updated 2 years ago
ProjectD-AI / LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆69Updated 2 years ago
sunzeyeah / RLHF
Implementation of Chinese ChatGPT
☆287Updated last year
CoinCheung / gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆97Updated last year
Ascend / AscendSpeed
☆79Updated last year
void-main / FasterTransformer
Transformer related optimization, including BERT, GPT
☆59Updated last year
CSHaitao / ChatGLM_mutli_gpu_tuning
deepspeed+trainer简单高效实现多卡微调大模型
☆127Updated 2 years ago
Oneflow-Inc / libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
☆408Updated last week
Tlntin / ChatGLM2-6B-TensorRT
☆90Updated 2 years ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆137Updated 8 months ago
alipay / PainlessInferenceAcceleration
Accelerate inference without tears
☆322Updated 4 months ago
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆152Updated 2 years ago
bojone / NBCE
Naive Bayes-based Context Extension
☆325Updated 8 months ago
yangjianxin1 / LLMPruner
☆310Updated 2 years ago
OpenBMB / BMCook
Model Compression for Big Models
☆164Updated 2 years ago
zejunwang1 / easytokenizer
高性能文本 Tokenizer 库
☆30Updated last year
taishan1994 / sentencepiece_chinese_bpe
使用sentencepiece中BPE训练中文词表，并在transformers中进行使用。
☆119Updated 2 years ago
volcengine / veGiantModel
☆220Updated last year
carbonz0 / alpaca-chinese-dataset
alpaca中文指令微调数据集
☆394Updated 2 years ago