xor2k / gpu_undervoltLinks
☆42Updated 2 years ago
Alternatives and similar repositories for gpu_undervolt
Users that are interested in gpu_undervolt are comparing it to the libraries listed below
Sorting:
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆77Updated 11 months ago
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆23Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆104Updated 6 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆81Updated last week
- My personal fork of koboldcpp where I hack in experimental samplers.☆46Updated last year
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆55Updated last week
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆11Updated 5 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆70Updated 2 years ago
- GPU Power and Performance Manager☆60Updated last year
- ☆84Updated 3 weeks ago
- Easily view and modify JSON datasets for large language models☆83Updated 5 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆143Updated 2 months ago
- Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…☆51Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated 2 years ago
- Experimental LLM Inference UX to aid in creative writing☆123Updated 10 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆108Updated 2 years ago
- NVIDIA Linux open GPU with P2P support☆66Updated 3 weeks ago
- A community list of common phrases generated by GPT and Claude models☆78Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Stable Diffusion and Flux in pure C/C++☆21Updated this week
- An Extension for oobabooga/text-generation-webui☆36Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆63Updated 2 years ago
- Simple monkeypatch to boost AMD Navi 3 GPUs☆48Updated 6 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆17Updated 8 months ago
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆107Updated last year
- ☆20Updated last year
- Makes llama.cpp easy to use.☆11Updated 5 months ago