alexriggio / BERT-LoRA-TensorRTLinks
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆77Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
Sorting:
- ☆30Updated 11 months ago
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆30Updated 9 months ago
- ☆46Updated 5 months ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆57Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 10 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆89Updated 6 months ago
- a curated list of the role of small models in the LLM era☆101Updated 9 months ago
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆45Updated last year
- This is the code of MMOA-RAG.☆53Updated last month
- Finetune mistral-7b-instruct for sentence embeddings☆84Updated last year
- ☆12Updated last year
- 📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classifi…☆50Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆50Updated last week
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆98Updated 4 months ago
- Test-time compute in information retrieval☆32Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆66Updated last year
- ☆72Updated last year
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆69Updated 10 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆84Updated 5 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆74Updated 8 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 8 months ago
- Open Implementations of LLM Analyses☆104Updated 8 months ago
- ☆36Updated 5 months ago
- Code for KaLM-Embedding models☆78Updated 3 months ago
- ☆20Updated 3 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated 2 years ago
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- Reward Model framework for LLM RLHF☆61Updated 2 years ago