alexriggio / BERT-LoRA-TensorRT
This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Rank Approximation (LoRA). The models are optimized for high performance using NVIDIA's TensorRT.
☆70Updated last year
Alternatives and similar repositories for BERT-LoRA-TensorRT:
Users that are interested in BERT-LoRA-TensorRT are comparing it to the libraries listed below
- ☆30Updated 8 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆57Updated 3 months ago
- Fine-tuning LLM with LoRA (Low-Rank Adaptation) from scratch (Oct 2023)☆19Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" b…☆42Updated last year
- Evaluating the Factuality of Large Language Models using Large-Scale Knowledge Graphs☆28Updated 7 months ago
- ☆41Updated 2 months ago
- Mining Logical Rules with Large Language Models for Knowledge Graph Reasoning with 1 dollar.☆57Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆77Updated 4 months ago
- SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning (NeurIPS D&B Track 2024)☆78Updated last year
- ☆30Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- We want to try and evaluate LLMs using Knowledge Graphs☆104Updated last year
- Code implementation of synthetic continued pretraining☆97Updated 2 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆52Updated 7 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆55Updated last year
- a curated list of the role of small models in the LLM era☆97Updated 6 months ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆196Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 2 months ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Updated last year
- [NeurIPS 2024] Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs☆51Updated 2 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆62Updated 9 months ago
- LLM guided text clustering☆87Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago
- ☆68Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆58Updated 5 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆53Updated 6 months ago
- LLM Roleplay: Simulating Human-Chatbot Interaction☆26Updated 2 weeks ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆59Updated last year
- ☆16Updated 8 months ago