h9-tec / Qwen_MOE_CLinks
☆41Updated 5 months ago
Alternatives and similar repositories for Qwen_MOE_C
Users that are interested in Qwen_MOE_C are comparing it to the libraries listed below
Sorting:
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Updated 5 months ago
- ☆122Updated 7 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆47Updated 4 months ago
- Load and run Llama from safetensors files in C☆15Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 4 months ago
- ☆109Updated 6 months ago
- ☆18Updated 4 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆83Updated last week
- ☆51Updated 3 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆154Updated 6 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆49Updated 3 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆87Updated this week
- ☆63Updated 6 months ago
- High-Performance Text Deduplication Toolkit☆61Updated 4 months ago
- Sherpa-onnx-tts-stt source for homeassisstant addon with Kroko Onnx Streaming STT integration.☆23Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 8 months ago
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆54Updated 8 months ago
- Enhancing LLMs with LoRA☆205Updated 2 months ago
- automatically quant GGUF models☆220Updated 3 weeks ago
- ☆23Updated last year
- ☆19Updated 3 months ago
- ☆23Updated last month
- ☆51Updated 10 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- Something similar to Apple Intelligence?☆59Updated last year
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆101Updated 6 months ago
- ☆100Updated 7 months ago
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆210Updated 7 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 3 months ago
- run ollama & gguf easily with a single command☆52Updated last year