fshnkarimi / Fine-tuning-an-LLM-using-LoRALinks
π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classification tasks using the Stanford Sentiment Treebank (SST-2) dataset and the LoRA technique.
β54Updated 2 years ago
Alternatives and similar repositories for Fine-tuning-an-LLM-using-LoRA
Users that are interested in Fine-tuning-an-LLM-using-LoRA are comparing it to the libraries listed below
Sorting:
- a curated list of the role of small models in the LLM eraβ111Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β88Updated last year
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applicationsβ67Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitioβ¦β111Updated 3 months ago
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finanβ¦β68Updated 7 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancementβ193Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β156Updated 2 years ago
- FuseAI Projectβ87Updated last year
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Raβ¦β78Updated 2 years ago
- β31Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β68Updated last year
- minimal scripts for 24GB VRAM GPUs. training, inference, whateverβ50Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paperβ32Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluationβ103Updated last year
- β78Updated 2 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β60Updated 2 years ago
- β161Updated last year
- [ACL'24] Official Implementation of the paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https:β¦β47Updated 9 months ago
- Two approaches for robust TableQA: 1) ITR is a general-purpose retrieval-based approach for handling long tables in TableQA transformer mβ¦β41Updated 2 years ago
- LLM guided text clusteringβ114Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.β207Updated 2 years ago
- β82Updated 2 months ago
- Automatic prompt optimization framework for multi-step agent tasks.β36Updated last year
- "Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" bβ¦β45Updated last year
- [ACL Oral 2025] The official GitHub repository for TC-RAG (Turing-Complete RAG)β74Updated 11 months ago
- Large Language Models are zero-shot text classifiers; Smart Expert System: Large Language Models as Text Classifiersβ35Updated last year
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answeringβ117Updated last year
- This is a repository of RALM surveys containing a summary of state-of-the-art RAG and other technologiesβ201Updated last year
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"β137Updated 2 years ago