lamm-mit / LLM-finetuningLinks
☆29Updated last year
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆235Updated 2 months ago
- ☆90Updated 7 months ago
- minimal GRPO implementation from scratch☆100Updated 9 months ago
- ☆26Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆125Updated 10 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆114Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆195Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆132Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆242Updated last year
- ☆278Updated last year
- ☆573Updated 7 months ago
- ☆79Updated 2 months ago
- ☆48Updated 8 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆138Updated 7 months ago
- ☆120Updated last year
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆457Updated 3 months ago
- ☆37Updated 7 months ago
- ☆147Updated last year
- Notebooks and code with some RAG techniques using llamaindex☆30Updated last year
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆68Updated 6 months ago
- An extension of the nanoGPT repository for training small MOE models.☆219Updated 9 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆180Updated 5 months ago
- Official code of the ACL 2025 paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆131Updated 5 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 6 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆116Updated 10 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆165Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality data☆337Updated 8 months ago