jjziets / vasttoolsLinks
My swiftsknife for vast.ai service
☆138Updated 5 months ago
Alternatives and similar repositories for vasttools
Users that are interested in vasttools are comparing it to the libraries listed below
Sorting:
- Vast.ai python and cli api client☆152Updated this week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Pipeline is an open source python SDK for building AI/ML workflows☆134Updated 9 months ago
- Inference code for facebook LLaMA models with Wrapyfi support☆129Updated 2 years ago
- 🐍 | Python library for RunPod API and serverless worker SDK.☆239Updated this week
- A repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model load…☆114Updated 3 years ago
- ☆50Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Wheels for llama-cpp-python compiled with cuBLAS support☆97Updated last year
- A prompt/context management system☆170Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆110Updated last year
- ☆171Updated 5 months ago
- RunPod Serverless Worker for Oobabooga Text Generation API for LLMs☆2Updated last year
- Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT☆48Updated 2 years ago
- Repository for Chat LLaMA - training a LoRA for the LLaMA (1 or 2) models on HuggingFace with 8-bit or 4-bit quantization. Research only.☆149Updated last year
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Inference code for LLaMA models☆188Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆38Updated last year
- ☆85Updated 2 years ago
- Awesome Stability List☆112Updated 2 years ago
- A curated list of amazing RunPod projects, libraries, and resources☆117Updated 10 months ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Framework agnostic python runtime for RWKV models☆147Updated last year
- 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client☆313Updated last year
- Run hugging face spaces locally with one command!☆58Updated 2 years ago
- ☆54Updated last year
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- OpenAI API webserver☆188Updated 3 years ago
- DiffusionWithAutoscaler☆29Updated last year