autoscriptlabs / nccl-mesh-pluginLinks
☆83Updated 3 weeks ago
Alternatives and similar repositories for nccl-mesh-plugin
Users that are interested in nccl-mesh-plugin are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 8 months ago
- The DPAB-α Benchmark☆32Updated last year
- InferX: Inference as a Service Platform☆156Updated this week
- ☆62Updated 6 months ago
- Editor with LLM generation tree exploration☆83Updated 11 months ago
- ☆113Updated 3 months ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆40Updated 4 months ago
- Pivotal Token Search☆144Updated last month
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆166Updated last month
- Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.☆252Updated 3 weeks ago
- LLM powered drawio live editor☆51Updated last month
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆282Updated last month
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆55Updated 7 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Updated 2 weeks ago
- High-performance FlashAttention-2 for AMD, Intel, and Apple GPUs. Drop-in replacement for PyTorch SDPA. Triton backend for ROCm (MI300X, …☆146Updated last week
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- Information Processing Evaluation for Large Language Models☆37Updated 2 weeks ago
- ☆440Updated 2 months ago
- Test your local LLMs on the AIME problems☆32Updated 8 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆88Updated 3 weeks ago
- Running Microsoft's BitNet via Electron, React & Astro☆52Updated 4 months ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆18Updated 9 months ago
- ☆51Updated 4 months ago
- ☆64Updated 7 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆84Updated 6 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆49Updated 5 months ago
- synthetic dataset generation workflow using local file resources for finetuning llms.☆82Updated 4 months ago