lamm-mit / LLM-finetuningLinks
☆27Updated 10 months ago
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- ☆75Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆117Updated 5 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆46Updated 10 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆219Updated last month
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆312Updated last week
- ☆129Updated 10 months ago
- ☆210Updated 5 months ago
- minimal GRPO implementation from scratch☆92Updated 4 months ago
- ☆118Updated 10 months ago
- ☆71Updated 5 months ago
- Tina: Tiny Reasoning Models via LoRA☆269Updated last month
- ☆145Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆97Updated last month
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆80Updated 4 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆142Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆319Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆75Updated this week
- Exploring Applications of GRPO☆243Updated 2 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆150Updated last month
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆54Updated last month
- X-LoRA: Mixture of LoRA Experts☆231Updated 11 months ago
- Repository for Zochi's Research☆245Updated 2 weeks ago
- RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation [ACL 2025]☆115Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆72Updated 7 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆101Updated 9 months ago
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆20Updated last week
- Framework enabling modular interchange of language agents, environments, and optimizers☆98Updated this week
- Notes and commented code for RLHF (PPO)☆99Updated last year
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆177Updated this week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆112Updated last week