☆15Dec 22, 2023Updated 2 years ago
Alternatives and similar repositories for runpod-vllm
Users that are interested in runpod-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Aug 2, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure Haskell (A port of llama2.c from Andrej Karpathy)☆14Oct 17, 2025Updated 7 months ago
- AI Gateway Provider for Vercel AI SDK☆34May 29, 2025Updated last year
- ☆11Dec 23, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure C☆13Nov 17, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated 2 years ago
- ☆20May 30, 2025Updated last year
- Build visualizations live!☆22Jan 5, 2023Updated 3 years ago
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆33Oct 19, 2024Updated last year
- A wasmCloud provider for the ollama API☆12Apr 23, 2024Updated 2 years ago
- CMake and other scripts to help build process of FlyEM software☆27Jun 9, 2022Updated 4 years ago
- A simple shortcut to have access to chatgpt anywhere on your computer☆15Mar 26, 2023Updated 3 years ago
- ☆17Feb 2, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- Inference Llama 2 in one file of pure C & one file with CUDA☆32Oct 14, 2023Updated 2 years ago
- Simple example showing how to run an entire desktop environment inside of a docker container☆17Sep 14, 2023Updated 2 years ago
- Computer Vision and Machine Learning Jupyter Notebooks for Educational Purposes☆83Nov 7, 2025Updated 7 months ago
- ☆23Dec 30, 2023Updated 2 years ago
- ☆37May 23, 2025Updated last year
- A salesforce library designed to provide idiomatic clojure representations of salesforce data and metadata☆11Jan 14, 2020Updated 6 years ago
- My Langchain Code archive maybe☆24Dec 25, 2023Updated 2 years ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆55Oct 13, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Clojure library for parsing and seamless working with native C structs/structured byte buffers☆14May 26, 2015Updated 11 years ago
- LLM plugin for models hosted on Replicate☆66Apr 18, 2024Updated 2 years ago
- ☆14Jun 5, 2026Updated last week
- Radix Primitives Cheatsheet☆12Mar 11, 2022Updated 4 years ago
- converts url content into JSON with a simple prefix☆73May 8, 2024Updated 2 years ago
- A full-featured, hackable Next.js AI chatbot built by Vercel but running solely on a VPS, no outside APIs except for LLMs☆12Apr 16, 2024Updated 2 years ago
- Explorations into specification-as-a-value☆42Mar 15, 2013Updated 13 years ago
- Schema-aware JSON compression with millisecond lookups — cut transfer/storage while enabling exists /pos queries. (Demo + wheels; core is…☆24Feb 21, 2026Updated 3 months ago
- All-in-one car management and tuning hub for DIY mechanics and car enthusiasts.☆20Aug 26, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🛸 A SvelteKit implementation of Hoppscotch.☆12May 3, 2025Updated last year
- LLM Building Blocks for Python Course☆17Nov 17, 2025Updated 6 months ago
- JavaFX micro-framework that follows MVVM Pattern with Google Guice dependency Injection☆11Jan 11, 2022Updated 4 years ago
- Simple and extensible framework for watching and handling file system events using the Java 7 Watch Service API.☆41Jan 10, 2013Updated 13 years ago
- A conversational UI for chatbots using the llama.cpp server☆14May 26, 2025Updated last year
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical…☆17Apr 22, 2025Updated last year