Tencent / TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
1,503Updated last year

Alternatives and similar repositories for TurboTransformers:

Users that are interested in TurboTransformers are comparing it to the libraries listed below