siddharth-sharma7 / fast-Bart
View external linksLinks

Convert BART models to ONNX with quantization. 3X reduction in size, and upto 3X boost in inference speed
33Dec 11, 2024Updated last year

Alternatives and similar repositories for fast-Bart

Users that are interested in fast-Bart are comparing it to the libraries listed below

Sorting:

Are these results useful?