alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
β78Updated 2 years ago
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifiβ¦β55Updated 2 years ago
- a curated list of the role of small models in the LLM eraβ111Updated last year
- β34Updated last year
- Text classification with Foundation Language Model LLaMAβ113Updated 2 years ago
- Finetune mistral-7b-instruct for sentence embeddingsβ88Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memoryβ62Updated 2 years ago
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphsβ34Updated last year
- β56Updated 11 months ago
- β12Updated 2 years ago
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))β58Updated last month
- Code for our paper "Graph Language Models"β77Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β88Updated 11 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"β103Updated last year
- We want to try and evaluate LLMs using Knowledge Graphsβ111Updated 2 years ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasetsβ194Updated last year
- Open replication of DeepSeek R1 for text-to-graph extraction.β99Updated 11 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?β167Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β69Updated last year
- β99Updated 4 years ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Modelsβ17Updated 2 years ago
- β35Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddβ¦β63Updated last year
- Implementation of ECIR 2022 Paper: How Can Graph Neural Networks Help Document Retrieval: A Case Study on CORD19 with Concept Map Generatβ¦β15Updated 3 years ago
- Code/data for MARG (multi-agent review generation)β59Updated 3 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ102Updated last year
- Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Loraβ51Updated 2 years ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)β85Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β156Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.β40Updated 2 years ago