5X faster 60% less memory QLoRA finetuning
☆21May 28, 2024Updated last year
Alternatives and similar repositories for unsloth
Users that are interested in unsloth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆146Sep 20, 2024Updated last year
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 3 weeks ago
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 4 months ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆69May 26, 2024Updated last year
- My Gen AI research☆11Jun 3, 2024Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Feb 18, 2024Updated 2 years ago
- ☆29Apr 29, 2024Updated 2 years ago
- ☆10Oct 18, 2023Updated 2 years ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Mar 7, 2025Updated last year
- build n8n workflows with AI☆26Nov 27, 2025Updated 5 months ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Group-relative Trajectory-based Policy Optimization: Increasing Quality and Training Stability☆40Feb 23, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A collection of autogen skills for use with locally run models☆14Feb 29, 2024Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- Porting of espressif/arduino-esp32 example to M5Stack CoreS3 (GC0308)☆11Nov 30, 2023Updated 2 years ago
- ☆11Aug 3, 2023Updated 2 years ago
- Build your own custom knowledge base from various sources such as youtube videos transcripts, tweets, articles, videos and audios. Uses G…☆13Dec 15, 2023Updated 2 years ago
- ☆26Dec 13, 2024Updated last year
- CI scripts designed to build a Pascal-compatible version of vLLM.☆12Aug 10, 2024Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆92Feb 27, 2024Updated 2 years ago
- ☆14Feb 7, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MicroPython viper documentation and examples☆16Apr 19, 2024Updated 2 years ago
- Diapositivas, notebooks y material de charlas, talleres y el grupo de estudio☆11Apr 24, 2024Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- ☆16Mar 14, 2024Updated 2 years ago
- A pipeline parallel training script for LLMs.☆168Apr 30, 2025Updated last year
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆32Oct 19, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 3 weeks ago
- Python bindings for llama.cpp☆68Feb 29, 2024Updated 2 years ago
- ☆13Jun 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12Apr 27, 2026Updated last week
- Replace OpenAI with Llama.cpp Automagically.☆329Jun 9, 2024Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 9 months ago
- The most powerful local music generation model that outperforms most commercial alternatives☆89Apr 20, 2026Updated 2 weeks ago