generativelabs/exllama-runpod-serverless

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/generativelabs/exllama-runpod-serverless)

generativelabs / exllama-runpod-serverless

☆17

Alternatives and similar repositories for exllama-runpod-serverless

Users that are interested in exllama-runpod-serverless are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ashleykleynhans / faceswap-api
View on GitHub
GPU-accelerated face swapping API with 13 models, CodeFormer restoration, and VRAM-safe serial queue. Powered by insightface and FaceFusi…
☆24Updated this week
JohnZolton / snorkle
View on GitHub
100% Local Document deep search with LLMs
☆26Sep 5, 2024Updated last year
ashleykleynhans / runpod-worker-a1111
View on GitHub
RunPod Serverless Worker for the Automatic1111 Stable Diffusion API
☆16Feb 16, 2026Updated 5 months ago
bryanchrist / llama2-70b
View on GitHub
Codebase for fine-tuning Llama2 70B to generate math test questions and answers.
☆11Aug 30, 2024Updated last year
dynamiccreator / lora_scripts
View on GitHub
This repo helps to transform text into a better form for lora training
☆12Apr 9, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OthersideAI / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Nov 27, 2023Updated 2 years ago
sger / wwdc-bot
View on GitHub
WWDC slackbot in Golang for http://asciiwwdc.com
☆11Apr 21, 2016Updated 10 years ago
runpod-workers / worker-template
View on GitHub
Starting point to build your own custom serverless endpoint
☆134May 9, 2025Updated last year
jamescalam / pod-gpt
View on GitHub
☆12Aug 15, 2023Updated 2 years ago
lambdaconcept / openocd
View on GitHub
Spen's Official OpenOCD Mirror (no pull requests)
☆12Jan 27, 2020Updated 6 years ago
ashleykleynhans / rerender-a-video-docker
View on GitHub
Docker image for Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
☆11Apr 14, 2024Updated 2 years ago
Zabriskije / CoreML-VAEs
View on GitHub
Ready to use Core ML VAEs in MLMODELC format
☆13Dec 25, 2023Updated 2 years ago
prabaljainn / Linkedin-Messaging-Bot
View on GitHub
An efficient LinkedIn Messaging Bot that Messages a group of interests, YouTube link in Documentation for guidance.
☆12Aug 25, 2021Updated 4 years ago
bench-ai / benchkit
View on GitHub
Pytorch directly integrated to the cloud all through Bench AI!
☆10Dec 10, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
houseofbaud / doug
View on GitHub
doug is an ai experiment with openai, llama-cpp-python, and langchain
☆16Sep 2, 2025Updated 10 months ago
timonmat / ChatObsidian
View on GitHub
AI QA interface to your obsidian notes
☆18May 31, 2023Updated 3 years ago
marcusschiesser / vectorstores
View on GitHub
Vectorstores is a framework for using vector databases in your AI applications
☆17Mar 16, 2026Updated 4 months ago
deepily / genie-in-the-box
View on GitHub
Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS
☆15Apr 24, 2024Updated 2 years ago
qwerdf4 / InstantID-swapface-multiple_in_out
View on GitHub
Unofficial InstantID,support multiple inputs and outputs, face fusion, swap face.
☆12Feb 27, 2024Updated 2 years ago
AllAboutAI-YT / ai-engineer-project1
View on GitHub
AI Engineer Skills Beginners Project 1 - Chat with YouTube
☆13Nov 6, 2023Updated 2 years ago
kiri-art / docker-diffusers-api-runpod
View on GitHub
Docker template for running docker-diffusers-api on runpod.io
☆13Jun 10, 2023Updated 3 years ago
agentsea / toolfuse
View on GitHub
A common protocol for AI agent tools
☆10Oct 21, 2024Updated last year
sam-s10s / pipecat-guess-who
View on GitHub
Pipecat Guess Who?
☆16Aug 1, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aptlin / posterlens
View on GitHub
A dataset of posters for movies from MovieLens-25M
☆11May 17, 2021Updated 5 years ago
memsy-io / memsy
View on GitHub
Monorepo for SDKs, Connectors and Docs
☆30Jul 3, 2026Updated 3 weeks ago
IzumiSatoshi / Tune-A-Video
View on GitHub
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
☆12Feb 23, 2023Updated 3 years ago
kj3moraes / movieclip
View on GitHub
An experiment with movie scenes and contrastive learning
☆11Feb 1, 2025Updated last year
Noctem / MonoGen
View on GitHub
Automated and Manual creation of PTC accts
☆10Sep 24, 2017Updated 8 years ago
wtlow003 / investment-advisor-gpt
View on GitHub
Your friendly investment advisor has now turned into an LLM chatbot!
☆15Mar 24, 2024Updated 2 years ago
cocoonlife / s3-log-parse
View on GitHub
Tools for parsing s3 logs, either from python or via cli
☆12Aug 10, 2023Updated 2 years ago
kaalam / Jazz
View on GitHub
☆11Mar 29, 2026Updated 3 months ago
runpod / runpod-python
View on GitHub
🐍 | Python library for Runpod API and serverless worker SDK.
☆304Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
derekarends / openai-demos
View on GitHub
☆10Jul 29, 2023Updated 2 years ago
20Sunny / chatgpt-clone
View on GitHub
☆10Oct 1, 2023Updated 2 years ago
Raymo111 / voiceprint
View on GitHub
Voice biometric authentication PAM module for Linux
☆49Sep 18, 2022Updated 3 years ago
ashleykleynhans / tts-webui-docker
View on GitHub
Docker image for TTS Generation ALL IN ONE
☆20May 15, 2026Updated 2 months ago
vladmandic / insightface
View on GitHub
InsightFace for TFJS
☆12Sep 18, 2022Updated 3 years ago
j-webtek / Local-LLM_FineTune
View on GitHub
Finetune Your Local LLM
☆18Sep 23, 2023Updated 2 years ago
anthonykasza / beginner_brogramming
View on GitHub
scripts to help beginners program in Bro
☆21Aug 10, 2013Updated 12 years ago