WapaMario63 / GPTQ-for-LLaMa-ROCmLinks
4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below
Sorting:
- DEPRECATED!☆50Updated last year
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆216Updated last month
- Web UI for ExLlamaV2☆514Updated 10 months ago
- Falcon LLM ggml framework with CPU and GPU support☆248Updated last year
- Multi AMD GPU Setup for AI Development on Ubuntu with ROCM☆43Updated 2 months ago
- A prompt/context management system☆170Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆173Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- Lord of LLMS☆293Updated 3 months ago
- ☆157Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆274Updated last month
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated 2 years ago
- TheBloke's Dockerfiles☆308Updated last year
- Erudito: Easy API/CLI to ask questions about your documentation☆99Updated 2 years ago
- A manual for helping using tesla p40 gpu☆139Updated last year
- A multimodal, function calling powered LLM webui.☆217Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated 2 years ago
- A free AI text generation interface based on KoboldAI☆33Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- Experimental LLM Inference UX to aid in creative writing☆127Updated last year
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆168Updated last year
- A fast batching API to serve LLM models☆189Updated last year
- ☆535Updated 2 years ago
- Memoir+ a persona memory extension for Text Gen Web UI.☆223Updated last month
- A community list of common phrases generated by GPT and Claude models☆79Updated 2 years ago
- ☆48Updated 2 years ago
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆52Updated 2 years ago