unslothai / unsloth
Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥
☆37,861Updated this week
Alternatives and similar repositories for unsloth:
Users that are interested in unsloth are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆46,456Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆48,206Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆13,829Updated this week
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆19,634Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆21,842Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆139,442Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆43,502Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,222Updated this week
- LLM inference in C/C++☆79,077Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆92,548Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆19,346Updated 2 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,357Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆24,672Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆27,929Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆24,899Updated this week
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆13,844Updated this week
- Agno is a lightweight library for building Agents with memory, knowledge, tools and reasoning.☆25,993Updated this week
- DSPy: The framework for programming—not prompting—language models☆23,963Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18,274Updated this week
- Go ahead and axolotl questions☆9,258Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆21,888Updated last month
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,718Updated last month
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,184Updated 3 months ago
- Memory for AI Agents; SOTA in AI Agent Memory, beating OpenAI Memory in accuracy by 26% - https://mem0.ai/research☆28,444Updated this week
- An open-source RAG-based tool for chatting with your documents.☆22,126Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,064Updated last week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,616Updated last week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆51,166Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆41,371Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆12,238Updated this week