Library for model distillation
☆169Sep 6, 2025Updated 9 months ago
Alternatives and similar repositories for DistillFlow
Users that are interested in DistillFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Feb 20, 2025Updated last year
- ☆101Apr 14, 2025Updated last year
- The Fastest Way to Fine-Tune LLMs Locally☆342Dec 18, 2025Updated 6 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Crow is a Desktop AI Assistant☆33Aug 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A pipeline for LLM knowledge distillation☆116May 7, 2026Updated last month
- ☆94Jul 7, 2025Updated 11 months ago
- ☆47Apr 6, 2025Updated last year
- Play with OpenAI API's using your own API Key. Your API Key is stored and used only from your browser.☆14Dec 20, 2025Updated 6 months ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆17Apr 11, 2025Updated last year
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆39Jul 2, 2025Updated 11 months ago
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆79Oct 8, 2025Updated 8 months ago
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Static suckless single batch CUDA-only qwen3-0.6B mini inference engine☆557Sep 8, 2025Updated 9 months ago
- ☆99Nov 6, 2024Updated last year
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆296May 16, 2025Updated last year
- 日期时间实体识别☆11Sep 10, 2020Updated 5 years ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- ☆19Aug 19, 2025Updated 10 months ago
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆109Jun 27, 2025Updated last year
- Service for testing out the new Qwen2.5 omni model☆62Apr 30, 2025Updated last year
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆52Feb 10, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- an auto-sleeping and -waking framework around llama.cpp☆13Feb 8, 2025Updated last year
- A tool for generating function arguments and choosing what function to call with local LLMs☆437Mar 12, 2024Updated 2 years ago
- ☆20Jul 4, 2025Updated 11 months ago
- ☆16Feb 5, 2025Updated last year
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆53Nov 20, 2024Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆690Mar 22, 2025Updated last year
- A CI/CD tool that automatically captures code changes, generates mobile-optimized HTML diffs, uploads them to cloud storage, and sends no…☆28Sep 9, 2025Updated 9 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Clipboard Regex Replace is a lightweight GoLang application that allows you to automatically apply regex-based replacements to your clipb…☆10Jan 20, 2026Updated 5 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆20Jan 10, 2025Updated last year
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,933Updated this week
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆44May 18, 2025Updated last year
- Generate Your Own Private Morning Radio for Commute☆33Feb 5, 2025Updated last year
- Proxy based on QUIC.☆11Feb 3, 2022Updated 4 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆69Aug 21, 2024Updated last year