alexriggio / BERT-LoRA-TensorRT

This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
67Updated last year

Alternatives and similar repositories for BERT-LoRA-TensorRT:

Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below