lamm-mit / LLM-finetuningLinks
☆28Updated last year
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆122Updated 9 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆232Updated last month
- ☆141Updated last year
- ☆120Updated last year
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆113Updated last year
- ☆25Updated last year
- minimal GRPO implementation from scratch☆99Updated 8 months ago
- ☆18Updated 4 months ago
- Official code repository for Sketch-of-Thought (SoT)☆129Updated 6 months ago
- SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning☆82Updated last week
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆84Updated 7 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆150Updated last year
- ☆79Updated last month
- X-LoRA: Mixture of LoRA Experts☆251Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆89Updated 5 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆130Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆165Updated 2 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆112Updated 5 months ago
- ☆35Updated 6 months ago
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆119Updated 10 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆101Updated this week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆130Updated 3 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆105Updated 7 months ago
- ☆222Updated 8 months ago
- Repository for Zochi's Research☆284Updated 3 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆233Updated last week
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆114Updated 9 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆231Updated 6 months ago
- This is the official repository for Auto-RAG.☆228Updated 4 months ago