alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆73Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- ☆30Updated 9 months ago
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆30Updated 8 months ago
- ☆43Updated 3 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- ☆12Updated last year
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆58Updated 5 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- This is the code of MMOA-RAG.☆51Updated last week
- Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"☆53Updated 6 months ago
- ☆29Updated 6 months ago
- This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).☆105Updated 3 years ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆93Updated 3 months ago
- a curated list of the role of small models in the LLM era☆100Updated 7 months ago
- We want to try and evaluate LLMs using Knowledge Graphs☆105Updated 2 years ago
- Codes and packages for the paper titled Evaluating Retrieval Quality in Retrieval-Augmented Generation.☆23Updated last year
- Code for our paper "Graph Language Models"☆70Updated 8 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 11 months ago
- Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆20Updated 7 months ago
- Code/data for MARG (multi-agent review generation)☆43Updated 6 months ago
- PGRAG☆48Updated 10 months ago
- ☆69Updated last year
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆19Updated last year
- Benchmark baseline for retrieval qa applications☆109Updated last year
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆126Updated 6 months ago
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆43Updated last year
- https://acl2023-retrieval-lm.github.io/☆153Updated last year