runpod / runpod-pythonLinks
π | Python library for RunPod API and serverless worker SDK.
β232Updated last week
Alternatives and similar repositories for runpod-python
Users that are interested in runpod-python are comparing it to the libraries listed below
Sorting:
- π§° | RunPod CLI for pod managementβ313Updated 4 months ago
- Starting point to build your own custom serverless endpointβ107Updated 3 weeks ago
- π³ | Dockerfiles for the RunPod container images used for our official templates.β187Updated 3 weeks ago
- A curated list of amazing RunPod projects, libraries, and resourcesβ112Updated 9 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β318Updated 3 weeks ago
- Examples of models deployable with Trussβ174Updated this week
- Automatic1111 serverless worker.β89Updated 2 weeks ago
- βοΈ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.β57Updated 2 weeks ago
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMsβ2Updated last year
- A fast batching API to serve LLM modelsβ181Updated last year
- TheBloke's Dockerfilesβ303Updated last year
- The code we currently use to fine-tune models.β114Updated last year
- A pipeline parallel training script for LLMs.β147Updated last month
- β52Updated last year
- LoRA inference model packaged with Cogβ74Updated last year
- Low-Rank adapter extraction for fine-tuned transformers modelsβ171Updated last year
- LoRA training model packaged with Cogβ115Updated last year
- A collection of cog models for use on Replicateβ23Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ78Updated last year
- β83Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.β115Updated last year
- Templating language for generating prompts for text to image generators such as Stable Diffusionβ139Updated 9 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.β255Updated 3 months ago
- A prompt/context management systemβ170Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.β104Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.β37Updated last year
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fastβ¦β266Updated 7 months ago
- Gradio UI for a Cog APIβ66Updated last year
- β114Updated 5 months ago
- Some models defined with Cog to show you how it worksβ161Updated last week