alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆78Updated 2 years ago
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- a curated list of the role of small models in the LLM era☆111Updated last year
- Text classification with Foundation Language Model LLaMA☆113Updated 2 years ago
- ☆57Updated last year
- Code for our paper "Graph Language Models"☆77Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆54Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆207Updated 2 years ago
- LLM guided text clustering☆114Updated 2 years ago
- ☆34Updated last year
- We want to try and evaluate LLMs using Knowledge Graphs☆111Updated 2 years ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Updated 2 years ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- Benchmark baseline for retrieval qa applications☆119Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆88Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆68Updated last year
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆123Updated last year
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))☆58Updated 2 months ago
- Official implementation of A* Networks☆152Updated 2 years ago
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆32Updated 6 months ago
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆34Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆45Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.☆169Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆143Updated 2 years ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆193Updated last year
- [ACL'24] Official Implementation of the paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https:…☆47Updated 9 months ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Updated 2 years ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆195Updated last year
- ☆43Updated 2 years ago