lamm-mit / LLM-finetuningLinks
☆26Updated 9 months ago
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 4 months ago
- ☆74Updated last month
- Set of scripts to finetune LLMs☆37Updated last year
- ☆22Updated 10 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆45Updated 9 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆55Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆212Updated last week
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- ☆118Updated 10 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆78Updated 3 months ago
- ☆93Updated last month
- Open AI data scientist agent that automates complex data analysis tasks using the ReAct framework. Execute Python code locally or in the …☆62Updated this week
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆18Updated last year
- A Software Framework Enabling Modular Interchange of Language Agents, Environments, and Optimizers☆92Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆71Updated this week
- code for training & evaluating Contextual Document Embedding models☆195Updated last month
- ☆66Updated last year
- working implimention of deepseek MLA☆42Updated 5 months ago
- ☆69Updated 4 months ago
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆22Updated last year
- Prune transformer layers☆69Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆123Updated 9 months ago
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆43Updated last month
- Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆114Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago