☆50Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for exllama-runpod-serverless
Users that are interested in exllama-runpod-serverless are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Aug 18, 2023Updated 2 years ago
- ☆54Jun 11, 2023Updated 3 years ago
- TheBloke's Dockerfiles☆309Mar 8, 2024Updated 2 years ago
- The Runpod worker template for serving our large language model endpoints. Powered by vLLM.☆453Jun 12, 2026Updated last week
- ☆19Dec 2, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- RunPod Serverless Worker for the Automatic1111 Stable Diffusion API☆15Feb 16, 2026Updated 4 months ago
- A curated list of amazing Runpod projects, libraries, and resources☆130Apr 4, 2026Updated 2 months ago
- RunPod Serverless Worker for Real-ESRGAN Restoration and Upscaling☆13Feb 13, 2026Updated 4 months ago
- A guidance compatibility layer for llama-cpp-python☆37Sep 11, 2023Updated 2 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆17Apr 7, 2024Updated 2 years ago
- 🐍 | Python library for Runpod API and serverless worker SDK.☆299Jun 5, 2026Updated 2 weeks ago
- Next Auth IORedisAdapter example☆21Feb 21, 2023Updated 3 years ago
- ☆14Mar 30, 2024Updated 2 years ago
- Legal document analysis using BERT and FlanT5☆16Aug 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The public repository for developers to build and publish apps on the Awiros AppStack☆12Dec 11, 2022Updated 3 years ago
- (WIP) Foundry scripting template for deploying contracts to deterministic addresses on any network☆31Jul 9, 2022Updated 3 years ago
- Lennard-Jones Molecular Dynamics for beginners☆15Sep 20, 2021Updated 4 years ago
- ☆16Feb 15, 2023Updated 3 years ago
- interact with Runpod via the cli☆414Updated this week
- Design and implementation of a threshold-cryptography library☆24Jan 24, 2025Updated last year
- ☆22Jul 25, 2023Updated 2 years ago
- Opinionated Langchain setup with Qdrant vector store and Kong gateway☆33Apr 7, 2023Updated 3 years ago
- An application that helps you summarize your meetings in real time using OpenAI's ChatGPT APIs.☆12Mar 14, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Jul 12, 2024Updated last year
- An MCP tool server that provides stateful, TUI-compatible terminal sessions.☆15Feb 3, 2025Updated last year
- WWDC slackbot in Golang for http://asciiwwdc.com☆11Apr 21, 2016Updated 10 years ago
- Host for WebGL NZ:P builds.☆25Jun 14, 2026Updated last week
- Pytorch implementation of Ex3 : Automatic Novel Writing by Extracting, Excelsior and Expanding which proposes a framework to automaticall…☆20Sep 1, 2024Updated last year
- ☆12Dec 13, 2023Updated 2 years ago
- sync put.io to a local directory☆19Apr 3, 2017Updated 9 years ago
- Design of Optimal PID Controller for the Speed Control of DC Motor by Using Artificial Neural Network.☆18Aug 9, 2023Updated 2 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This repository contains the code snippets used in "LLM Prompt Engineering For Developers"☆14Apr 22, 2024Updated 2 years ago
- Immersed boundary fractional step method written in FORTRAN 90☆17Aug 14, 2012Updated 13 years ago
- Docker image for Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation☆11Apr 14, 2024Updated 2 years ago
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆24Nov 6, 2023Updated 2 years ago
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆15May 14, 2025Updated last year
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago