lamm-mit / LLM-finetuningLinks
☆30Updated last year
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated 4 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated last year
- ☆27Updated last year
- Master the essential steps of pretraining large language models (LLMs). Learn to create high-quality datasets, configure model architectu…☆24Updated last year
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Updated 8 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- ☆80Updated 4 months ago
- ☆51Updated 10 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆132Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- This is the official repository for Auto-RAG.☆232Updated 6 months ago
- ☆229Updated 11 months ago
- Repository for Zochi's Research☆300Updated 2 months ago
- Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model tra…☆182Updated last year
- ☆120Updated last year
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆245Updated 8 months ago
- ☆582Updated 9 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆120Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Official code repository for Sketch-of-Thought (SoT)☆135Updated 9 months ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆121Updated this week
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆242Updated last year
- ☆107Updated 10 months ago
- Official code of the ACL 2025 paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆134Updated 6 months ago
- ☆159Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆277Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Updated last year
- ☆282Updated last year