alexriggio / BERT-LoRA-TensorRTView on GitHub
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
78Nov 14, 2023Updated 2 years ago

Alternatives and similar repositories for BERT-LoRA-TensorRT

Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below

Sorting:

Are these results useful?