alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆70Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT:
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
- ☆30Updated 9 months ago
- ☆42Updated 2 months ago
- This is the code of MMOA-RAG.☆50Updated last month
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆29Updated 7 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Updated last year
- a curated list of the role of small models in the LLM era☆100Updated 7 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆40Updated 4 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆28Updated 7 months ago
- Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.☆58Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆43Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆83Updated 4 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆56Updated 4 months ago
- ☆12Updated last year
- ☆20Updated 3 years ago
- LLM guided text clustering☆92Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆45Updated 6 months ago
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆19Updated 5 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 8 months ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆38Updated 2 years ago
- ☆69Updated last year
- Open replication of DeepSeek R1 for text-to-graph extraction.☆93Updated 2 months ago
- ☆28Updated last year
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆80Updated last year
- ☆15Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆88Updated 2 months ago
- ☆55Updated 6 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago