☆93Updated this week
Alternatives and similar repositories for nccl-mesh-plugin
Users that are interested in nccl-mesh-plugin are comparing it to the libraries listed below
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 6 months ago
- ☆11Sep 18, 2023Updated 2 years ago
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.☆25Jan 29, 2026Updated last month
- ☆33Feb 6, 2026Updated 3 weeks ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆28Dec 29, 2025Updated 2 months ago
- One-Click RAG Implementation, Simple and Portable☆30Oct 5, 2025Updated 4 months ago
- A c++ framework on efficient training & fine-tuning LLMs☆28Feb 10, 2026Updated 2 weeks ago
- LLM FX: A LLM Server Desktop Client free for everyone!☆36Updated this week
- Inference Llama3.2 1B/3B base/instruct models in 1 file of pure C☆21Jul 22, 2025Updated 7 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆171Dec 15, 2025Updated 2 months ago
- *NIX SHELL with Local AI/LLM integration☆24Feb 26, 2025Updated last year
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆40Aug 3, 2025Updated 6 months ago
- Google Bot Guard Request☆19Dec 13, 2019Updated 6 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- Demo of an "always-on" AI assistant.☆24Feb 14, 2024Updated 2 years ago
- Code intelligence for AI assistants - MCP server, CLI, and HTTP API with symbol navigation, impact analysis, and architecture mapping☆59Feb 17, 2026Updated last week
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆36Jul 16, 2025Updated 7 months ago
- Run LLaMA inference on CPU, with Rust 🦀🚀🦙☆35Jan 5, 2025Updated last year
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 8 months ago
- Samples of good AI generated CUDA kernels☆100May 30, 2025Updated 9 months ago
- Robust, privacy focused home AI assistant in Rust.☆42Sep 21, 2025Updated 5 months ago
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated last week
- Like system requirements lab but for LLMs☆31Jun 10, 2023Updated 2 years ago
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆94Feb 15, 2026Updated 2 weeks ago
- Modification of daveshap/ChromaDB_Chatbot_Public that allows for end-users to customize the behavior/memories of the chatbot☆13Jun 30, 2023Updated 2 years ago
- Store Terraform state for your GitHub Actions as an encrypted artifact or repository file.☆10Jul 12, 2024Updated last year
- Your universal AI text processor, powered by local and cloud LLMs. Edit, refactor, and transform text in any application on Windows, macO…☆72Nov 9, 2025Updated 3 months ago
- Helper package to spin-up a Qdrant instance without Docker☆13Dec 24, 2023Updated 2 years ago
- ☆11Mar 1, 2023Updated 3 years ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆36Aug 3, 2023Updated 2 years ago
- MVC fastify decorator Dependency injection Inversion of Control Typescript☆11Jan 5, 2023Updated 3 years ago
- A comprehensive technical review agent inspired by Bertrand Gilfoyle - providing code quality, security, architecture, and UX analysis wi…☆13Aug 20, 2025Updated 6 months ago
- LCM Drawing app☆12Dec 1, 2023Updated 2 years ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 4 months ago
- A novel media player that allows you to navigate by speaker☆89Dec 22, 2025Updated 2 months ago
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆40Apr 1, 2025Updated 11 months ago
- Real-time Vision Language Model interaction via webcam - WebRTC-based web interface☆231Dec 17, 2025Updated 2 months ago