mmarius / awesome-finetuning
A curated list of resources on fine-tuning language models.
☆23Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-finetuning
- Repository containing awesome resources regarding Hugging Face tooling.☆43Updated 10 months ago
- Documentation for dynamic machine learning systems.☆27Updated 2 months ago
- Basic guidance on how to contribute to Papers with Code☆20Updated 2 years ago
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆19Updated last year
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆44Updated this week
- Inference examples☆18Updated 2 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- ☆21Updated 2 weeks ago
- Building LLM-Enabled Multi Agent Applications with AutoGen☆25Updated last month
- ML/DL Math and Method notes☆57Updated 11 months ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Updated 4 years ago
- Based on the tree of thoughts paper☆45Updated last year
- Reward Model framework for LLM RLHF☆58Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆28Updated last year
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆17Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆13Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆17Updated 4 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆40Updated last year
- Clean RL implementation using MLX☆26Updated 8 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- ☆10Updated 11 months ago
- Repo to reproduce the First-Explore paper results☆36Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 2 months ago
- Solutions for the book "Speech and Language Processing" (3rd ed. draft) by Dan Jurafsky and James H. Martin☆15Updated 2 years ago
- A Mental Health conversational LLM☆34Updated last year
- Web interface to search ArXiv papers using NLP Sentence-Transformers, Faiss and Streamlit☆19Updated last year
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆14Updated 7 months ago