unslothai / unsloth
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
☆28,154Updated this week
Alternatives and similar repositories for unsloth:
Users that are interested in unsloth are comparing it to the libraries listed below
- A high-throughput and memory-efficient inference and serving engine for LLMs☆37,544Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,315Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,402Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆9,320Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆40,309Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,519Updated last week
- The official Meta Llama 3 GitHub site☆28,268Updated 2 weeks ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,189Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆15,158Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆26,435Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.☆35,901Updated this week
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆16,773Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆25,340Updated this week
- Agno is a lightweight framework for building multi-modal Agents☆18,785Updated this week
- Go ahead and axolotl questions☆8,566Updated this week
- Universal LLM Deployment Engine with ML Compilation☆19,930Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆19,747Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,847Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆22,517Updated this week
- DSPy: The framework for programming—not prompting—language models☆21,807Updated this week
- Drag & drop UI to build your customized LLM flow☆35,001Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆17,949Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆39,360Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆20,555Updated this week
- LLM inference in C/C++☆73,971Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆18,642Updated 3 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆72,741Updated this week
- MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone☆18,359Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆21,760Updated 3 weeks ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆28,698Updated this week