THUDM/FasterTransformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUDM/FasterTransformer)

THUDM / FasterTransformer

Transformer related optimization, including BERT, GPT

☆39

Alternatives and similar repositories for FasterTransformer

Users that are interested in FasterTransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Rayrtfr / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆17Jul 29, 2023Updated 2 years ago
THUDM / WinGNN
View on GitHub
☆10May 18, 2023Updated 3 years ago
Oneflow-Inc / one-glm
View on GitHub
A more efficient GLM implementation!
☆54Feb 18, 2023Updated 3 years ago
triton-inference-server / hugectr_backend
View on GitHub
☆57Oct 17, 2023Updated 2 years ago
THUDM / MRT
View on GitHub
MRT: Tracing the Evolution of Scientific Publications (TKDE 2021)
☆18Mar 23, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
starmee / AI-Notes
View on GitHub
My learning notes about AI, including Machine Learning and Deep Learning.
☆18Jun 30, 2019Updated 7 years ago
void-main / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆59Sep 20, 2023Updated 2 years ago
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
bytedance / ByteTransformer
View on GitHub
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Mar 15, 2024Updated 2 years ago
triton-inference-server / fastertransformer_backend
View on GitHub
☆413Nov 11, 2023Updated 2 years ago
zejunwang1 / easytokenizer
View on GitHub
高性能文本 Tokenizer 库
☆31Feb 2, 2024Updated 2 years ago
simon-mo / vLLM-Benchmark
View on GitHub
☆33Apr 19, 2025Updated last year
void-main / fastertransformer_backend
View on GitHub
☆22Jul 11, 2023Updated 3 years ago
leimao / OpenAI_Gym_AI
View on GitHub
These are my learning algorithm solutions to OpenAI Gym environments.
☆11May 9, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
THUDM / Multilingual-GLM
View on GitHub
The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective
☆63Nov 19, 2022Updated 3 years ago
THUDM / Tsinghua-ML-Course
View on GitHub
Course Materials for ML Course at Tsinghua
☆29Dec 17, 2019Updated 6 years ago
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,442Mar 27, 2024Updated 2 years ago
tachitachi / GradientReversal
View on GitHub
Tensorflow implementation of the Gradient Reversal layer from https://arxiv.org/abs/1505.07818
☆13Jun 19, 2018Updated 8 years ago
THUDM / FastLDM
View on GitHub
Inference speed-up for stable-diffusion (ldm) with TensorRT.
☆35Jun 19, 2023Updated 3 years ago
ssbuild / llm_rlhf
View on GitHub
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
☆27Sep 19, 2023Updated 2 years ago
wangguojim / LargeScale
View on GitHub
☆19May 11, 2024Updated 2 years ago
jpqiang / Chinese-Idiom-Paraphrasing
View on GitHub
☆15Dec 8, 2022Updated 3 years ago
Leonardo-Ding / gpu_sgemm
View on GitHub
☆17Jul 1, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
OKIC-CA / RUL
View on GitHub
Survival Analysis with Machine Learning for Predicting Li-ion Battery Remaining Useful Life
☆15May 4, 2026Updated 2 months ago
neural-dialogue-metrics / EmbeddingBased
View on GitHub
Embedding-based evaluation metrics for dialogue generation.
☆15Jan 8, 2023Updated 3 years ago
blackboxnlp / 2020
View on GitHub
☆11Nov 20, 2020Updated 5 years ago
mlcommons / inference_results_v0.7
View on GitHub
This repository contains the results and code for the MLPerf™ Inference v0.7 benchmark.
☆17Jul 24, 2025Updated 11 months ago
THUDM / icetk
View on GitHub
A unified tokenization tool for Images, Chinese and English.
☆153Mar 23, 2023Updated 3 years ago
THUDM / SwissArmyTransformer
View on GitHub
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
☆1,119Dec 26, 2024Updated last year
shawnlimn / ScaleGrad
View on GitHub
Source code for ScaleGrad
☆19Dec 28, 2021Updated 4 years ago
yhwang-hub / yolov7_QAT
View on GitHub
Quantize yolov7 using pytorch_quantization.🚀🚀🚀
☆12Oct 20, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Silin159 / PersonaChat-BART-PeaCoK
View on GitHub
☆12Nov 10, 2023Updated 2 years ago
cameronfr / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆14Jun 27, 2023Updated 3 years ago
OpenBMB / cpm_kernels
View on GitHub
☆25Oct 2, 2023Updated 2 years ago
NVIDIA / HMM_sample_code
View on GitHub
CUDA 12.2 HMM demos
☆21Jul 26, 2024Updated last year
BaofengZan / hard_decode_trt-windows
View on GitHub
https://github.com/shouxieai/hard_decode_trt windows编译版本
☆13Sep 8, 2022Updated 3 years ago
loretoparisi / htk
View on GitHub
HTK Toolkit with Linux 64 bit and Docker support
☆20Oct 4, 2021Updated 4 years ago
alanshi / charset_mnbvc
View on GitHub
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
☆70Oct 17, 2025Updated 9 months ago