vast-ai / vast-cliLinks
Vast.ai python and cli api client
☆159Updated this week
Alternatives and similar repositories for vast-cli
Users that are interested in vast-cli are comparing it to the libraries listed below
Sorting:
- 🐍 | Python library for RunPod API and serverless worker SDK.☆248Updated this week
- 🧰 | RunPod CLI for pod management☆331Updated last month
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- ☆121Updated last year
- My swiftsknife for vast.ai service☆145Updated 7 months ago
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- A community list of common phrases generated by GPT and Claude models☆78Updated last year
- ☆63Updated 8 months ago
- Examples of models deployable with Truss☆200Updated this week
- ☆141Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- 🐳 | Dockerfiles for the RunPod container images used for our official templates.☆202Updated 2 weeks ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆365Updated this week
- ☆53Updated 3 weeks ago
- ☆86Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- ☆171Updated 6 months ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆90Updated 2 years ago
- Pipeline is an open source python SDK for building AI/ML workflows☆137Updated 11 months ago
- Starting point to build your own custom serverless endpoint☆121Updated 3 months ago
- faster-whisper as serverless endpoint☆116Updated 3 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆158Updated 4 months ago
- DiffusionWithAutoscaler☆29Updated last year
- Chat to Compose Video☆193Updated last year
- ☆116Updated 8 months ago
- Plug n Play GBNF Compiler for llama.cpp☆27Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆37Updated 2 years ago
- GPU prices aggregator for cloud providers☆40Updated this week
- An OpenAI-like LLaMA inference API☆113Updated last year