A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively
☆114Feb 22, 2026Updated 3 months ago
Alternatives and similar repositories for dgx-vllm
Users that are interested in dgx-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆20Mar 24, 2026Updated 2 months ago
- ☆25Oct 13, 2025Updated 7 months ago
- Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors e…☆72Updated this week
- ☆32Jan 2, 2026Updated 4 months ago
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆45Jan 26, 2026Updated 4 months ago
- ☆14Mar 8, 2025Updated last year
- A tool-call based memory system for SillyTavern☆36Dec 30, 2025Updated 4 months ago
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 10 months ago
- Jekyll theme for translated articles☆13Apr 3, 2019Updated 7 years ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- ☆30Feb 18, 2025Updated last year
- ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting☆14Jun 5, 2019Updated 6 years ago
- Rewritten frontend for SillyTavern☆74Feb 28, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Automate things, visualize your flows.☆39Jan 16, 2026Updated 4 months ago
- ☆49May 20, 2025Updated last year
- ☆19Jul 9, 2021Updated 4 years ago
- Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"☆28Mar 2, 2025Updated last year
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆74Nov 30, 2025Updated 5 months ago
- Easy MCP (Model Context Protocol) servers and AI agents, defined as YAML.☆19Dec 9, 2025Updated 5 months ago
- this repo holds the software and hardware files for my animatronic eye project☆59Nov 5, 2024Updated last year
- CLI tool that applies an ASCII filter to video or image.☆13Jun 20, 2023Updated 2 years ago
- Python module to help in exploitation of the FILE structure in C☆27Dec 2, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MCP Server for RSS, Atom, and JSON Feeds☆25May 17, 2026Updated last week
- Python code implementing the algorithm designed by Mueen at UC Riverside. The description of the paper can be found in the paper - "Searc…☆13Oct 13, 2014Updated 11 years ago
- 🖼️ A program that makes a photo mosaic out of any image.☆11Apr 25, 2021Updated 5 years ago
- Smart code context extractor for AI assistants☆22Apr 12, 2026Updated last month
- A SillyTavern extension that helps you to make decisions about the story. It could give an idea.☆70Oct 26, 2025Updated 7 months ago
- A modern desktop application built with Tauri 2.0 for creating professional audiobooks using advanced text-to-speech and voice cloning te…☆86Jan 4, 2026Updated 4 months ago
- " End-to-End Efficient Representation Learning via Cascading Combinatorial Optimization" accepted at CVPR2019☆23May 10, 2019Updated 7 years ago
- ☆23Jun 8, 2019Updated 6 years ago
- Docker configuration for running VLLM on dual DGX Sparks☆1,426Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A local MCP server providing tools for exploring code change history and developer insights.☆28Jul 24, 2025Updated 10 months ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆96Apr 12, 2026Updated last month
- Video generator from youtube guitar tabs tutorials☆14Dec 28, 2022Updated 3 years ago
- Use twitter to get live and trending stock sentiment!☆15Aug 20, 2024Updated last year
- Optimized FP16/BF16 x FP4 GPU kernels for AMD GPUs☆53May 9, 2026Updated 2 weeks ago
- A system prompt approach enabling Large Language Models (LLMs) to perform post-response reasoning without additional training☆17Feb 13, 2025Updated last year