fshnkarimi / Fine-tuning-an-LLM-using-LoRALinks
π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classification tasks using the Stanford Sentiment Treebank (SST-2) dataset and the LoRA technique.
β50Updated last year
Alternatives and similar repositories for Fine-tuning-an-LLM-using-LoRA
Users that are interested in Fine-tuning-an-LLM-using-LoRA are comparing it to the libraries listed below
Sorting:
- a curated list of the role of small models in the LLM eraβ101Updated 9 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β85Updated 5 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.β150Updated last year
- β31Updated 8 months ago
- β45Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"β107Updated 9 months ago
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Raβ¦β77Updated last year
- β94Updated 7 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β77Updated 8 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applicationsβ54Updated 8 months ago
- FuseAI Projectβ87Updated 5 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β66Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Modelsβ97Updated last year
- β150Updated last year
- β47Updated 9 months ago
- Large Language Models are zero-shot text classifiers; Smart Expert System: Large Language Models as Text Classifiersβ33Updated last year
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Humanβ¦β58Updated 2 years ago
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrieversβ68Updated last month
- β30Updated last year
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finanβ¦β59Updated 2 weeks ago
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Languβ¦β13Updated 3 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answeringβ106Updated 5 months ago
- Official repo of Respond-and-Respond: data, code, and evaluationβ103Updated 11 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".β107Updated 8 months ago
- β76Updated 5 months ago
- β56Updated 7 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performanceβ77Updated 7 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.β59Updated last year
- β20Updated 3 years ago
- β91Updated 5 months ago