vast-ai / vast-cliLinks
Vast.ai python and cli api client
☆166Updated this week
Alternatives and similar repositories for vast-cli
Users that are interested in vast-cli are comparing it to the libraries listed below
Sorting:
- My swiftsknife for vast.ai service☆148Updated this week
- 🐍 | Python library for RunPod API and serverless worker SDK.☆254Updated last week
- 🧰 | RunPod CLI for pod management☆339Updated last month
- ☆54Updated last month
- Examples of models deployable with Truss☆205Updated this week
- ☆120Updated last year
- 🐳 | Dockerfiles for the RunPod container images used for our official templates.☆208Updated last week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆371Updated 3 weeks ago
- A template to run LLaMA in Cog☆66Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- ☆63Updated 10 months ago
- ☆116Updated 10 months ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- ☆141Updated last year
- Distributed Inference for mlx LLm☆97Updated last year
- Run AI models anywhere. https://muna.ai/explore☆68Updated last week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client☆315Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆27Updated last year
- Pipeline is an open source python SDK for building AI/ML workflows☆138Updated last year
- Web page with political compass quiz results for open LLMs☆35Updated last year
- Starting point to build your own custom serverless endpoint☆124Updated 5 months ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- Chat to Compose Video☆195Updated last year
- DiffusionWithAutoscaler☆29Updated last year
- 2D Positional Embeddings for Webpage Structural Understanding 🦙👀☆94Updated last year
- ☆162Updated 2 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆99Updated 2 years ago