lamm-mit / LLM-finetuningLinks

☆27

Alternatives and similar repositories for LLM-finetuning

Users that are interested in LLM-finetuning are comparing it to the libraries listed below

Sorting:

lamm-mit / AtomAgents
☆75Updated 2 months ago
AgnostiqHQ / multi-agent-llm
Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)
☆117Updated 5 months ago
alperiox / Compact-Language-Models-via-Pruning-and-Knowledge-Distillation
Unofficial implementation of https://arxiv.org/pdf/2407.14679
☆46Updated 10 months ago
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆219Updated last month
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆312Updated last week
AIRI-Institute / AriGraph
☆129Updated 10 months ago
vsubramaniam851 / multiagent-ft
☆210Updated 5 months ago
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆92Updated 4 months ago
writer / writing-in-the-margins
☆118Updated 10 months ago
LLMSELECTOR / LLMSELECTOR
☆71Updated 5 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆269Updated last month
apple / ml-superposition-prompting
☆145Updated last year
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆97Updated last month
fairyshine / Chain-of-Tools
The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".
☆80Updated 4 months ago
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆319Updated 3 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆75Updated this week
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆243Updated 2 weeks ago
ByteDance-Seed / Agent-R
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆150Updated last month
kagnlp / CodeGenerator
This repository contains popular code generation frameworks such as MapCoder, CodeSIM.
☆54Updated last month
EricLBuehler / xlora
X-LoRA: Mixture of LoRA Experts
☆231Updated 11 months ago
IntologyAI / Zochi
Repository for Zochi's Research
☆245Updated 2 weeks ago
sunnynexus / RetroLLM
RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]
☆115Updated 6 months ago
HishamAlyahya / semantic_backprop
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖
☆72Updated 7 months ago
Decentralised-AI / LFM-Liquid-AI-Liquid-Foundation-Models
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆101Updated 9 months ago
CLAIRE-Labo / quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …
☆20Updated last week
Future-House / ldp
Framework enabling modular interchange of language agents, environments, and optimizers
☆98Updated this week
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆99Updated last year
Jaykef / ai-algorithms
First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…
☆177Updated this week
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆112Updated last week