h9-tec / Qwen_MOE_CLinks
☆35Updated last week
Alternatives and similar repositories for Qwen_MOE_C
Users that are interested in Qwen_MOE_C are comparing it to the libraries listed below
Sorting:
- ☆113Updated last month
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆20Updated last week
- Load and run Llama from safetensors files in C☆12Updated 9 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆32Updated last month
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆28Updated 3 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆98Updated last month
- Lightweight C inference for Qwen3 GGUF with the smallest (0.6B) at the fullest (FP32)☆15Updated last week
- ☆93Updated last month
- ☆56Updated last month
- ☆16Updated 2 weeks ago
- ☆14Updated 6 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last month
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- Service for testing out the new Qwen2.5 omni model☆55Updated 3 months ago
- ☆101Updated 2 months ago
- ☆23Updated 9 months ago
- ☆57Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 3 months ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆47Updated 5 months ago
- Locally running LLM with internet access☆96Updated last month
- InferX is a Inference Function as a Service Platform☆123Updated 2 weeks ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆49Updated 2 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆43Updated 2 months ago
- ☆24Updated 6 months ago
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆196Updated last month
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆41Updated 3 weeks ago
- run ollama & gguf easily with a single command☆52Updated last year
- One click templates for inferencing Language Models☆203Updated this week
- Tiny Llama model trained to play chess☆24Updated 2 weeks ago
- ☆132Updated 3 months ago