☆15Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for runpod-vllm
Users that are interested in runpod-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Sep 4, 2025Updated 8 months ago
- Open-source AI for voice control, rivaling Alexa and Siri☆13Mar 9, 2024Updated 2 years ago
- Send images, captions and text to Telegram channels and DM's from comfyui☆12Apr 22, 2024Updated 2 years ago
- The Swift Programming Language☆12Updated this week
- ☆11Dec 23, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Nov 8, 2023Updated 2 years ago
- Android wrapper for Inference Llama 2 in one file of pure C☆18Aug 21, 2023Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- ☆20May 30, 2025Updated 11 months ago
- Build visualizations live!☆22Jan 5, 2023Updated 3 years ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Apr 20, 2024Updated 2 years ago
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆32Oct 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CMake and other scripts to help build process of FlyEM software☆27Jun 9, 2022Updated 3 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- Arabic Speech Recognition with Whisper: Fine-tune the Whisper model from OpenAI for Arabic speech recognition tasks. This repository prov…☆21Feb 28, 2024Updated 2 years ago
- Makes all PVH Disposable VM's fully ephemeral☆17Jun 16, 2022Updated 3 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Mar 12, 2024Updated 2 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆33Feb 21, 2026Updated 3 months ago
- Simple example showing how to run an entire desktop environment inside of a docker container☆17Sep 14, 2023Updated 2 years ago
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes☆83Nov 7, 2025Updated 6 months ago
- ☆23Dec 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Beautifully-designed, accessible search components☆37Jul 15, 2025Updated 10 months ago
- Comfy UI in Telegram☆30Nov 28, 2024Updated last year
- A salesforce library designed to provide idiomatic clojure representations of salesforce data and metadata☆11Jan 14, 2020Updated 6 years ago
- My Langchain Code archive maybe☆24Dec 25, 2023Updated 2 years ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆55Oct 13, 2025Updated 7 months ago
- Paxos in Python, tested with Jepsen☆32Dec 10, 2021Updated 4 years ago
- Give langchain access to the terminal☆33Apr 10, 2023Updated 3 years ago
- LLM plugin for models hosted on Replicate☆66Apr 18, 2024Updated 2 years ago
- Radix Primitives Cheatsheet☆12Mar 11, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- converts url content into JSON with a simple prefix☆73May 8, 2024Updated 2 years ago
- A full-featured, hackable Next.js AI chatbot built by Vercel but running solely on a VPS, no outside APIs except for LLMs☆12Apr 16, 2024Updated 2 years ago
- Schema-aware JSON compression with millisecond lookups — cut transfer/storage while enabling exists /pos queries. (Demo + wheels; core is…☆24Feb 21, 2026Updated 3 months ago
- All-in-one car management and tuning hub for DIY mechanics and car enthusiasts.☆20Aug 26, 2025Updated 9 months ago
- 🛸 A SvelteKit implementation of Hoppscotch.☆12May 3, 2025Updated last year
- A universal messaging library for cross-platform applications (Chrome extension, Web, Mobile, Iframe,...)☆15Oct 10, 2025Updated 7 months ago
- Interval Treeset based on finger trees☆11Oct 28, 2020Updated 5 years ago