jjziets / vasttoolsLinks

My swiftsknife for vast.ai service

☆138

Alternatives and similar repositories for vasttools

Users that are interested in vasttools are comparing it to the libraries listed below

Sorting:

vast-ai / vast-cli
Vast.ai python and cli api client
☆152Updated this week
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated last year
mystic-ai / pipeline
Pipeline is an open source python SDK for building AI/ML workflows
☆134Updated 9 months ago
modular-ml / wrapyfi-examples_llama
Inference code for facebook LLaMA models with Wrapyfi support
☆129Updated 2 years ago
runpod / runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
☆239Updated this week
arrmansa / Basic-UI-for-GPT-J-6B-with-low-vram
A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…
☆114Updated 3 years ago
krea-ai / prompt-search
☆50Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆97Updated last year
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆110Updated last year
huggingface / api-inference-community
☆171Updated 5 months ago
ashleykleynhans / runpod-worker-oobabooga
RunPod Serverless Worker for Oobabooga Text Generation API for LLMs
☆2Updated last year
bjoernpl / llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
☆48Updated 2 years ago
serp-ai / LLaMA-8bit-LoRA
Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.
☆149Updated last year
jina-ai / big_creative_ai
BIG: Back In the Game of Creative AI
☆27Updated 2 years ago
shawwn / llama
Inference code for LLaMA models
☆188Updated 2 years ago
TheBloke / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆38Updated last year
bananaml / serverless-template
☆85Updated 2 years ago
Stability-AI / awesome-stability
Awesome Stability List
☆112Updated 2 years ago
kodxana / Awesome-RunPod
A curated list of amazing RunPod projects, libraries, and resources
☆117Updated 10 months ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆51Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆147Updated last year
petals-infra / chat.petals.dev
💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client
☆313Updated last year
FrancescoSaverioZuppichini / my-spaces
Run hugging face spaces locally with one command!
☆58Updated 2 years ago
qnguyen3 / hermes-llava
☆54Updated last year
labmlai / neox
Simple Annotated implementation of GPT-NeoX in PyTorch
☆110Updated 2 years ago
shawwn / openai-server
OpenAI API webserver
☆188Updated 3 years ago
Lightning-Universe / DiffusionWithAutoscaler
DiffusionWithAutoscaler
☆29Updated last year