alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
β77Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- β30Updated 11 months ago
- π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifiβ¦β50Updated last year
- β48Updated 5 months ago
- a curated list of the role of small models in the LLM eraβ101Updated 9 months ago
- Finetune mistral-7b-instruct for sentence embeddingsβ84Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β85Updated 5 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval anβ¦β30Updated 9 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β66Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?β159Updated last year
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)β23Updated this week
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ90Updated 7 months ago
- β84Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" bβ¦β45Updated last year
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphsβ30Updated 10 months ago
- Code for our paper "Graph Language Models"β71Updated 10 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)β79Updated last year
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasetsβ163Updated last year
- Fine-tuning of Flan-5T LLM for text classification π€ focuses on adapting a state-of-the-art language model to enhance its ability to claβ¦β39Updated 8 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.β200Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.β38Updated 2 years ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasksβ57Updated last year
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.β37Updated 6 months ago
- β17Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddβ¦β60Updated 7 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β135Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"β74Updated 8 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β55Updated 9 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.β95Updated 5 months ago
- Code/data for MARG (multi-agent review generation)β44Updated 7 months ago