h9-tec / Qwen_MOE_CView external linksLinks
☆42Aug 2, 2025Updated 6 months ago
Alternatives and similar repositories for Qwen_MOE_C
Users that are interested in Qwen_MOE_C are comparing it to the libraries listed below
Sorting:
- ☆23Jan 14, 2025Updated last year
- A MCP stdio toolpack for local LLMs☆19Oct 6, 2025Updated 4 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆22Nov 26, 2025Updated 2 months ago
- ☆21Aug 9, 2025Updated 6 months ago
- ☆12May 11, 2024Updated last year
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated 3 weeks ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆67Oct 8, 2025Updated 4 months ago
- ☆38Jan 15, 2025Updated last year
- ☆15Feb 1, 2025Updated last year
- A curated collection of persona-based mcp server & tool groupings.☆34Sep 11, 2025Updated 5 months ago
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Aug 6, 2025Updated 6 months ago
- ☆36Updated this week
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- High-Performance Text Deduplication Toolkit☆61Aug 25, 2025Updated 5 months ago
- ☆19Jul 4, 2025Updated 7 months ago
- These agents work based on any local model. You ask your question and simply indicate the number of agents and experts who will answer it…☆19Feb 25, 2024Updated last year
- A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model C…☆27Dec 25, 2025Updated last month
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆50Feb 10, 2026Updated last week
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 5 months ago
- ☆17Jan 27, 2025Updated last year
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 7 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆53Mar 21, 2025Updated 10 months ago
- ☆23Dec 9, 2025Updated 2 months ago
- Service for testing out the new Qwen2.5 omni model☆63Apr 30, 2025Updated 9 months ago
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆42Oct 6, 2025Updated 4 months ago
- ☆23Sep 27, 2024Updated last year
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆54Sep 30, 2024Updated last year
- A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workf…☆177Feb 10, 2026Updated last week
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- An API for VoiceCraft.☆25Jun 27, 2024Updated last year
- A persistent local memory for AI, LLMs, or Copilot in VS Code.☆194Oct 27, 2025Updated 3 months ago
- ☆100Jul 26, 2025Updated 6 months ago
- ☆34Mar 22, 2025Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Aug 21, 2024Updated last year
- This project provides code to accompany the "AI and ML for Web Devs" video series, focusing on teaching AI and ML concepts through hands-…☆36Oct 12, 2025Updated 4 months ago
- Spotlight-like client for Ollama on Windows.☆28May 18, 2024Updated last year
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year