A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.
☆35Jul 16, 2025Updated 10 months ago
Alternatives and similar repositories for Kimi-K2-Mini
Users that are interested in Kimi-K2-Mini are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆43Aug 3, 2025Updated 9 months ago
- A MCP stdio toolpack for local LLMs☆32Apr 6, 2026Updated last month
- ☆43Aug 2, 2025Updated 9 months ago
- ☆16Feb 1, 2025Updated last year
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆24Aug 5, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆79Oct 8, 2025Updated 7 months ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- A bytebot variant that uses Holo 1.5 7b to control the desktop☆25Nov 4, 2025Updated 6 months ago
- A hackable library for running and fine-tuning modern transformer models on commodity and alternative GPUs, powered by tinygrad.☆29Feb 10, 2026Updated 3 months ago
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆31Nov 15, 2025Updated 6 months ago
- ☆47Mar 22, 2025Updated last year
- ☆41Mar 6, 2026Updated 2 months ago
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 7 years ago
- I have completed my first project that machine learning on streaming data using Kafka and Docker. You can check-up my GitHub repository f…☆12Sep 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Local banking voice assistant focused on banking☆65Apr 10, 2026Updated last month
- ☆102Oct 3, 2025Updated 7 months ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago
- For the better CI as well as CD using gogs and drone base on kubernetes☆10Jul 31, 2021Updated 4 years ago
- Named Entity Recognition (NER) of diseases☆13Feb 10, 2022Updated 4 years ago
- A modern, single-page web chat interface for local LLMs (Large Language Models), inspired by the visual style and UX of Anthropic's Claud…☆32May 11, 2025Updated last year
- Tiny, composable Atomic CSS engine☆13Apr 1, 2022Updated 4 years ago
- ☆11Feb 28, 2022Updated 4 years ago
- Demonstration of Single Sign On with an OpenId provider.☆12Oct 18, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Can We Characterize Tasks Without Labels or Features?" (CVPR 2021)☆11Aug 31, 2021Updated 4 years ago
- Collection of multiplayer WebGL games.☆11Aug 28, 2015Updated 10 years ago
- Play Balatro with LLMs 🎯☆91Updated this week
- ☆16Aug 22, 2022Updated 3 years ago
- data availability service for DAT☆17Jun 10, 2024Updated last year
- A PoC in-game level editor for Unity game engine☆13Nov 13, 2018Updated 7 years ago
- #Home Automation With MERN Stack☆20Nov 18, 2024Updated last year
- Implement rest api service for manipulating blog contents using FastAPI in Python☆12Feb 14, 2023Updated 3 years ago
- [TIP 2022] Deep Posterior Distribution-based Embedding for Hyperspectral Image Super-resolution☆13Nov 21, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- npm package template with typescript and tsup☆11Nov 27, 2025Updated 6 months ago
- Open WebUI tool — Give your LLM a persistent workspace with file storage, SQLite, archives, and collaboration.☆115Feb 2, 2026Updated 3 months ago
- code for the paper titled "Adaptive Cross-Layer Attention for Image Restoration"☆14Nov 6, 2025Updated 6 months ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆37Aug 3, 2023Updated 2 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- SvelteKit GraphQL queries using fetch only: how you can drop Apollo client and urql dependencies altogether to make your Svelte app leane…☆16Jul 23, 2025Updated 10 months ago