☆106Mar 6, 2026Updated last month
Alternatives and similar repositories for nccl-mesh-plugin
Users that are interested in nccl-mesh-plugin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆86Feb 22, 2026Updated last month
- Historical Language Model for London - A specialized LLM trained on 1500-1850 historical English text☆29Nov 1, 2025Updated 5 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 7 months ago
- Yet another `llama.cpp` Rust wrapper☆12Jun 19, 2024Updated last year
- ☆33Feb 6, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 9 months ago
- ☆11Sep 18, 2023Updated 2 years ago
- ☆14Dec 6, 2023Updated 2 years ago
- A pure and fast NumPy implementation of Mamba with cache support.☆18Jun 16, 2024Updated last year
- ☆22Oct 13, 2025Updated 6 months ago
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆30Nov 15, 2025Updated 4 months ago
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆34Dec 30, 2025Updated 3 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Jul 26, 2023Updated 2 years ago
- Store Terraform state for your GitHub Actions as an encrypted artifact or repository file.☆10Jul 12, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- KV Cache & LoRA for minGPT☆62Mar 4, 2026Updated last month
- ☆44Jan 26, 2026Updated 2 months ago
- PyTorch wheels for linux riscv64☆18Sep 21, 2024Updated last year
- A c++ framework on efficient training & fine-tuning LLMs☆27Updated this week
- Inference Llama3.2 1B/3B base/instruct models in 1 file of pure C☆22Jul 22, 2025Updated 8 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆93Mar 25, 2026Updated 2 weeks ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆42Aug 3, 2025Updated 8 months ago
- Demo of an "always-on" AI assistant.☆24Feb 14, 2024Updated 2 years ago
- ☆30Feb 18, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Samples of good AI generated CUDA kernels☆103May 30, 2025Updated 10 months ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 5 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆183Updated this week
- Implements harmful/harmless refusal removal using pure HF Transformers☆22May 8, 2025Updated 11 months ago
- *NIX SHELL with Local AI/LLM integration☆26Feb 26, 2025Updated last year
- Run LLaMA inference on CPU, with Rust 🦀🚀🦙☆34Jan 5, 2025Updated last year
- Rewritten frontend for SillyTavern☆70Feb 28, 2026Updated last month
- Automate things, visualize your flows.☆39Jan 16, 2026Updated 2 months ago
- ☆24Mar 12, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Pretty fast, pretty simple K3S clusters for Raspberry Pi☆14Jul 25, 2024Updated last year
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆41Oct 17, 2025Updated 5 months ago
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆36Jul 16, 2025Updated 8 months ago
- LLM FX: A LLM Server Desktop Client free for everyone!☆38Mar 8, 2026Updated last month
- Embedding Inversion via Conditional Masked Diffusion: recover original text from embedding vectors using parallel denoising. Live demo + …☆49Mar 7, 2026Updated last month
- A novel media player that allows you to navigate by speaker☆91Mar 25, 2026Updated 2 weeks ago