lamm-mit / LLM-finetuningLinks
☆27Updated 11 months ago
Alternatives and similar repositories for LLM-finetuning
Users that are interested in LLM-finetuning are comparing it to the libraries listed below
Sorting:
- ☆78Updated 3 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆111Updated 11 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆120Updated 6 months ago
- ☆75Updated 6 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆128Updated 11 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- Graph-R1: Towards Agentic GraphRAG Framework via End-to-end Reinforcement Learning☆367Updated 2 weeks ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆104Updated last week
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆219Updated 3 months ago
- Graph-Aware Attention for Adaptive Dynamics in Transformers☆63Updated 7 months ago
- ☆102Updated last month
- ☆22Updated last year
- ☆118Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆226Updated 2 months ago
- A collection of the the best ML and AI news every week (research, news, resources)☆168Updated last month
- ☆145Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 4 months ago
- The official implementation of the paper "Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models".☆85Updated 5 months ago
- open source alpha evolve☆67Updated 3 months ago
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- ☆134Updated 11 months ago
- Notebooks and code with some RAG techniques using llamaindex☆30Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated last month
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆120Updated last week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆56Updated last year
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆24Updated last month
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 8 months ago
- ☆34Updated last month