A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively
☆105Feb 22, 2026Updated 2 months ago
Alternatives and similar repositories for dgx-vllm
Users that are interested in dgx-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆108Mar 6, 2026Updated 2 months ago
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆18Mar 24, 2026Updated last month
- Learn faster with the power of AI☆17Apr 29, 2026Updated last week
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 5 months ago
- ☆24Oct 13, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆31Jan 2, 2026Updated 4 months ago
- Professional desktop app for converting text to audiobooks with local TTS☆32Oct 6, 2025Updated 7 months ago
- ☆29Feb 18, 2025Updated last year
- Automate things, visualize your flows.☆39Jan 16, 2026Updated 3 months ago
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆42Oct 17, 2025Updated 6 months ago
- ☆22May 14, 2024Updated last year
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆59Nov 30, 2025Updated 5 months ago
- Easy MCP (Model Context Protocol) servers and AI agents, defined as YAML.☆19Dec 9, 2025Updated 4 months ago
- A Keras-based recommendation engine for subreddits, channels on the popular social media site Reddit☆10Feb 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- this repo holds the software and hardware files for my animatronic eye project☆59Nov 5, 2024Updated last year
- MCP Server for RSS, Atom, and JSON Feeds☆23Updated this week
- A SillyTavern extension that allows you to create and play interactive character cards. Want to customize the scenario before starting? …☆39Jul 18, 2025Updated 9 months ago
- A modern desktop application built with Tauri 2.0 for creating professional audiobooks using advanced text-to-speech and voice cloning te…☆84Jan 4, 2026Updated 4 months ago
- Chrome extension that displays motivational startup quotes☆11Oct 4, 2018Updated 7 years ago
- Creating diff that supports wildcard produced by LLMs☆16Sep 18, 2024Updated last year
- Fork to run instances from SWE-rebench☆24Apr 22, 2026Updated 2 weeks ago
- FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI☆53Oct 21, 2025Updated 6 months ago
- A system prompt approach enabling Large Language Models (LLMs) to perform post-response reasoning without additional training☆17Feb 13, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Vibe Coded Project Management System☆21Apr 19, 2025Updated last year
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆58Jan 5, 2025Updated last year
- All components responsible for providing the service to the Reception System. This is the Prodigy Reloaded “Server”.☆14Oct 20, 2025Updated 6 months ago
- A suite of Model Context Protocol (MCP) servers designed to enhance AI agent capabilities. Provides tools for media search/understanding …☆20Oct 31, 2025Updated 6 months ago
- Intuitive graphical representation of source code☆14Mar 15, 2023Updated 3 years ago
- TOON as DSPy adapter☆26Feb 1, 2026Updated 3 months ago
- ARCHIVED - Materials for running a Team-Based Inquiry Learning linear algebra course☆10Jul 30, 2024Updated last year
- This custom nodes helps to auto download models from huggingface☆21Apr 5, 2025Updated last year
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25May 10, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 🦀 I'm learning Rust and publishing exercises and small projects I've completed!☆12Oct 15, 2024Updated last year
- A gui tool written in Dioxus to make it easy to release a workspace of crates to crates.io☆14Feb 22, 2023Updated 3 years ago
- Visual card-based snippets for 99 AI agent design patterns. Fork of awesome-agentic-patterns.☆94Jan 8, 2026Updated 3 months ago
- ☆15Aug 28, 2025Updated 8 months ago
- Microprocessor 2 Lab Template☆11Apr 29, 2024Updated 2 years ago
- ☆16Dec 3, 2024Updated last year
- Anonymize people in images and videos using yolov5-crowdhuman.☆26Jan 5, 2022Updated 4 years ago