xor2k / gpu_undervoltLinks
☆49Updated 2 years ago
Alternatives and similar repositories for gpu_undervolt
Users that are interested in gpu_undervolt are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated last week
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆109Updated 9 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆81Updated last week
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆67Updated 3 months ago
- ☆90Updated last month
- GPU Power and Performance Manager☆66Updated last year
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆110Updated 3 months ago
- Experimental LLM Inference UX to aid in creative writing☆128Updated last year
- LlamaTor: Decentralized AI model sharing via BitTorrent for efficient, user-friendly distribution and collaboration.☆56Updated last year
- Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS☆68Updated 2 years ago
- Easily view and modify JSON datasets for large language models☆87Updated 8 months ago
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆36Updated 2 years ago
- A community list of common phrases generated by GPT and Claude models☆79Updated 2 years ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆44Updated last year
- Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…☆51Updated last year
- Stable Diffusion and Flux in pure C/C++☆24Updated this week
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆125Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- A frontend for creative writing with LLMs☆146Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆108Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Writing Extension for Text Generation WebUI☆64Updated 6 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆146Updated 6 months ago
- ☆22Updated last year
- Web UI for ExLlamaV2☆513Updated last year
- Like system requirements lab but for LLMs☆31Updated 2 years ago