lamm-mit / LLM-finetuningLinks
☆25Updated 8 months ago
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- ☆74Updated 3 weeks ago
- A Software Framework Enabling Modular Interchange of Language Agents, Environments, and Optimizers☆87Updated this week
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆68Updated 6 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆113Updated 3 months ago
- ☆249Updated 10 months ago
- Graph-Aware Attention for Adaptive Dynamics in Transformers☆59Updated 5 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆209Updated 2 weeks ago
- ☆68Updated 3 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆91Updated 3 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆51Updated last month
- ☆22Updated 10 months ago
- ☆123Updated 8 months ago
- Repository for Zochi's Research☆192Updated last week
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆105Updated 3 weeks ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆190Updated 3 weeks ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆43Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆204Updated 3 months ago
- ☆93Updated last week
- Official code repository for Sketch-of-Thought (SoT)☆119Updated 3 weeks ago
- SciQAG is a novel framework for automatically generating high-quality science question-answer pairs from a large corpus of scientific lit…☆24Updated 2 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆75Updated 2 months ago
- The Open Source Code for LLM4SD (Large Language Models for Scientific Synthesis, Inference and Explanation)☆108Updated 5 months ago
- Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)☆15Updated this week
- ☆92Updated 2 months ago
- A Language Agent Gym with Challenging Scientific Tasks☆183Updated last week
- ☆118Updated 9 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆97Updated 8 months ago