alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
β78Updated 2 years ago
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifiβ¦β55Updated 2 years ago
- Code for our paper "Graph Language Models"β77Updated last year
- a curated list of the role of small models in the LLM eraβ111Updated last year
- Text classification with Foundation Language Model LLaMAβ113Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- β35Updated last year
- We want to try and evaluate LLMs using Knowledge Graphsβ111Updated 2 years ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β137Updated 2 years ago
- β34Updated 2 years ago
- Finetune mistral-7b-instruct for sentence embeddingsβ88Updated last year
- Official implementation of A* Networksβ152Updated 2 years ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?β167Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ66Updated last year
- LLM guided text clusteringβ111Updated 2 years ago
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphsβ34Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"β103Updated last year
- Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)β86Updated last year
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI mβ¦β224Updated 2 years ago
- β12Updated 2 years ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasetsβ192Updated last year
- Official Implementation of PatentLMM (our AAAI 2025 Paper)β13Updated 10 months ago
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))β58Updated last month
- Open Implementations of LLM Analysesβ108Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddβ¦β63Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.β40Updated 2 years ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β88Updated 11 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.β206Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memoryβ62Updated 2 years ago
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" bβ¦β45Updated last year
- [ACL 2024 Findings] Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classificationβ15Updated last year