alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
β77Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifiβ¦β52Updated last year
- β50Updated 6 months ago
- β30Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β86Updated 6 months ago
- a curated list of the role of small models in the LLM eraβ103Updated 10 months ago
- Code for our paper "Graph Language Models"β73Updated 11 months ago
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β135Updated last year
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)β24Updated last week
- Open replication of DeepSeek R1 for text-to-graph extraction.β98Updated 6 months ago
- We want to try and evaluate LLMs using Knowledge Graphsβ106Updated 2 years ago
- Evaluation tools for Retrieval-augmented Generation (RAG) methods.β162Updated 8 months ago
- Benchmark baseline for retrieval qa applicationsβ115Updated last year
- Finetune mistral-7b-instruct for sentence embeddingsβ85Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"β104Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Searchβ93Updated 8 months ago
- Text classification with Foundation Language Model LLaMAβ114Updated 2 years ago
- β31Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?β159Updated last year
- LLM guided text clusteringβ102Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasksβ57Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)β329Updated 7 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embeddβ¦β60Updated 7 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".β130Updated last year
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.β37Updated 7 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 9 months ago
- β12Updated last year
- [Paper][ACL 2024 Findings] Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answeringβ194Updated last year
- Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Loraβ51Updated last year
- β33Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuningβ64Updated last year