jjziets / vasttools
My swiftsknife for vast.ai service
☆134Updated 3 months ago
Alternatives and similar repositories for vasttools
Users that are interested in vasttools are comparing it to the libraries listed below
Sorting:
- Collection of tools I've made for hosts of vast.ai☆15Updated 3 years ago
- Vast.ai python and cli api client☆142Updated this week
- Prometheus Grafana nvidia gpu monitoring systems☆28Updated 6 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Running SXM2/SXM3/SXM4 NVidia data center GPUs in consumer PCs☆106Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- ☆54Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆99Updated 3 weeks ago
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆243Updated 2 years ago
- A curated list of amazing RunPod projects, libraries, and resources☆112Updated 8 months ago
- A prompt/context management system☆170Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- 🐳 | Dockerfiles for the RunPod container images used for our official templates.☆181Updated 2 weeks ago
- Run hugging face spaces locally with one command!☆58Updated 2 years ago
- 4 bits quantization of LLMs using GPTQ☆49Updated last year
- Framework agnostic python runtime for RWKV models☆146Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆97Updated last year
- ☆26Updated 2 years ago
- Pipeline is an open source python SDK for building AI/ML workflows☆133Updated 7 months ago
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆2Updated 11 months ago
- ☆1Updated 2 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆58Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆153Updated last year
- 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client☆311Updated last year
- Performant and accurate speech recognition built on Pytorch☆253Updated 2 years ago
- faster-whisper as serverless endpoint☆97Updated last week
- ☆41Updated last year
- simple prompt script to convert hf/ggml files to gguf, and to quantize☆26Updated last year