A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.
☆35Jul 16, 2025Updated 9 months ago
Alternatives and similar repositories for Kimi-K2-Mini
Users that are interested in Kimi-K2-Mini are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆43Aug 3, 2025Updated 9 months ago
- Multi-agent orchestration framework for AI applications - build, deploy, and manage AI agents across the full lifecycle with Forge, Conve…☆30Mar 28, 2026Updated last month
- A MCP stdio toolpack for local LLMs☆31Apr 6, 2026Updated last month
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated 3 months ago
- A curated collection of persona-based mcp server & tool groupings.☆36Sep 11, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Feb 1, 2025Updated last year
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆23Aug 5, 2025Updated 9 months ago
- ☆51Feb 19, 2026Updated 2 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆32Dec 29, 2025Updated 4 months ago
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆75Oct 8, 2025Updated 7 months ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- A bytebot variant that uses Holo 1.5 7b to control the desktop☆25Nov 4, 2025Updated 6 months ago
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆30Nov 15, 2025Updated 5 months ago
- ☆46Mar 22, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Agentic BYOK Browser-Based Website Builder☆44Updated this week
- ☆38Mar 6, 2026Updated 2 months ago
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 7 years ago
- Local banking voice assistant focused on banking☆56Apr 10, 2026Updated 3 weeks ago
- A simple but well-featured code editor☆11Apr 30, 2018Updated 8 years ago
- A complete Earthstar toolbox in the console.☆13Jan 2, 2023Updated 3 years ago
- npm package template with typescript and tsup☆10Nov 27, 2025Updated 5 months ago
- ☆100Oct 3, 2025Updated 7 months ago
- For the better CI as well as CD using gogs and drone base on kubernetes☆10Jul 31, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- QuickJs based wrapper generator for WASM components in written in JavaScript☆18Apr 30, 2026Updated last week
- Tiny, composable Atomic CSS engine☆13Apr 1, 2022Updated 4 years ago
- ☆11Feb 28, 2022Updated 4 years ago
- Vite utility for vue3 server side rendering☆10Jan 23, 2026Updated 3 months ago
- Code for "Can We Characterize Tasks Without Labels or Features?" (CVPR 2021)☆11Aug 31, 2021Updated 4 years ago
- css3d level editor☆15Jul 20, 2017Updated 8 years ago
- Collection of multiplayer WebGL games.☆11Aug 28, 2015Updated 10 years ago
- A holistic framework for advancing LLMs as data science agents☆40Feb 3, 2026Updated 3 months ago
- Play Balatro with LLMs 🎯☆83Apr 19, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A rudimentary Electron application that utilises WebTorrent to stream and download a torrent simultaneously☆15Apr 8, 2018Updated 8 years ago
- ☆10Mar 28, 2017Updated 9 years ago
- data availability service for DAT☆17Jun 10, 2024Updated last year
- A PoC in-game level editor for Unity game engine☆13Nov 13, 2018Updated 7 years ago
- Open WebUI tool — Give your LLM a persistent workspace with file storage, SQLite, archives, and collaboration.☆113Feb 2, 2026Updated 3 months ago
- Implement rest api service for manipulating blog contents using FastAPI in Python☆12Feb 14, 2023Updated 3 years ago
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago