WapaMario63 / GPTQ-for-LLaMa-ROCmLinks
4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Updated 2 years ago
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below
Sorting:
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆53Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆249Updated 2 years ago
- Web UI for ExLlamaV2☆513Updated last year
- A prompt/context management system☆168Updated 2 years ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Updated last year
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆217Updated last week
- Wheels for llama-cpp-python compiled with cuBLAS support☆102Updated 2 years ago
- ☆156Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- ☆535Updated 2 years ago
- ☆50Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆125Updated 2 years ago
- LLM that combines the principles of wizardLM and vicunaLM☆716Updated 2 years ago
- Experimental LLM Inference UX to aid in creative writing☆128Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- A manual for helping using tesla p40 gpu☆142Updated last year
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- My personal fork of koboldcpp where I hack in experimental samplers.☆44Updated last year
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- Inference on CPU code for LLaMA models☆137Updated 2 years ago
- CHAracter State Management - a generative text adventure☆55Updated 8 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆11Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- A free AI text generation interface based on KoboldAI☆33Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆146Updated 6 months ago
- A community list of common phrases generated by GPT and Claude models☆79Updated 2 years ago
- An autonomous AI agent extension for Oobabooga's web ui☆173Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago