☆103Mar 6, 2026Updated 2 weeks ago
Alternatives and similar repositories for nccl-mesh-plugin
Users that are interested in nccl-mesh-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆58Feb 22, 2026Updated last month
- A realtime speech to text diarization system to gather and interleave speech from multiple speaker audio.☆27Jan 29, 2026Updated last month
- A tiny model that teaches itself to code better. On your laptop. No cloud. No teacher model. No human feedback.☆56Mar 10, 2026Updated 2 weeks ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 6 months ago
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- ☆33Feb 6, 2026Updated last month
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 8 months ago
- ☆11Sep 18, 2023Updated 2 years ago
- ☆14Dec 6, 2023Updated 2 years ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- ☆19Oct 13, 2025Updated 5 months ago
- Multiturn VLM Bulk captioning using your api service☆35Mar 15, 2026Updated last week
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆29Dec 29, 2025Updated 2 months ago
- ☆27Jan 2, 2026Updated 2 months ago
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆30Dec 30, 2025Updated 2 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆44Mar 14, 2026Updated last week
- Professional desktop app for converting text to audiobooks with local TTS☆31Oct 6, 2025Updated 5 months ago
- ☆43Jan 26, 2026Updated last month
- A c++ framework on efficient training & fine-tuning LLMs☆27Mar 14, 2026Updated last week
- Inference Llama3.2 1B/3B base/instruct models in 1 file of pure C☆22Jul 22, 2025Updated 8 months ago
- Automated multi-account farming tool for Kite AI decentralized payment network with faucet claims, token staking, DEX swaps, daily quiz c…☆253Mar 13, 2026Updated last week
- Docker configuration for running VLLM on dual DGX Sparks☆648Updated this week
- A tool-call based memory system for SillyTavern☆30Dec 30, 2025Updated 2 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆91Feb 25, 2026Updated last month
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆82Updated this week
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 9 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆40Aug 3, 2025Updated 7 months ago
- *NIX SHELL with Local AI/LLM integration☆24Feb 26, 2025Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆28Jul 15, 2024Updated last year
- ☆29Feb 18, 2025Updated last year
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆176Mar 13, 2026Updated last week
- The rag pipeline for optimizing dynamic data editing.☆21Oct 30, 2025Updated 4 months ago
- ☆24Mar 12, 2026Updated last week
- Implements harmful/harmless refusal removal using pure HF Transformers☆22May 8, 2025Updated 10 months ago
- Run LLaMA inference on CPU, with Rust 🦀🚀🦙☆35Jan 5, 2025Updated last year
- Rewritten frontend for SillyTavern☆68Feb 28, 2026Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 10 months ago
- ☆16Feb 12, 2026Updated last month