fshnkarimi / Fine-tuning-an-LLM-using-LoRA
π Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classification tasks using the Stanford Sentiment Treebank (SST-2) dataset and the LoRA technique.
β47Updated last year
Alternatives and similar repositories for Fine-tuning-an-LLM-using-LoRA:
Users that are interested in Fine-tuning-an-LLM-using-LoRA are comparing it to the libraries listed below
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applicationsβ43Updated 4 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refβ¦β46Updated 3 weeks ago
- This is the code of MMOA-RAG.β44Updated last week
- β24Updated 6 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationalesβ78Updated last month
- This repository contains a custom implementation of the BERT model, fine-tuned for specific tasks, along with an implementation of Low Raβ¦β69Updated last year
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Languβ¦β13Updated 8 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuningβ46Updated last year
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrieversβ65Updated 10 months ago
- FuseAI Projectβ84Updated 2 months ago
- β16Updated 5 months ago
- Create a knowledge graph out of unstructed legal text - use said knowledge graph in a graph augmented retrieval augmented generation pipeβ¦β39Updated 6 months ago
- PGRAGβ47Updated 8 months ago
- β56Updated 6 months ago
- β35Updated 2 months ago
- a curated list of the role of small models in the LLM eraβ95Updated 6 months ago
- β142Updated 11 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [Fβ¦β62Updated 10 months ago
- β74Updated last year
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluationβ38Updated last year
- β20Updated 3 years ago
- β20Updated 8 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performanceβ63Updated 4 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systemsβ75Updated 3 weeks ago
- Codebase accompanying the Summary of a Haystack paper.β75Updated 6 months ago
- Fine-Tuning LLM and embedding modelsβ27Updated last year
- β88Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- CoNLI: a plug-and-play framework for ungrounded hallucination detection and reductionβ29Updated last year
- β73Updated 2 months ago