WapaMario63 / GPTQ-for-LLaMa-ROCmLinks
4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Updated last year
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below
Sorting:
- DEPRECATED!☆51Updated last year
- Web UI for ExLlamaV2☆513Updated 7 months ago
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆210Updated last week
- ☆535Updated last year
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆51Updated 2 years ago
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- A community list of common phrases generated by GPT and Claude models☆78Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- A fast batching API to serve LLM models☆187Updated last year
- ☆37Updated 2 years ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated 2 years ago
- A manual for helping using tesla p40 gpu☆130Updated 10 months ago
- Lord of LLMS☆294Updated this week
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated 2 years ago
- TheBloke's Dockerfiles☆307Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆110Updated 2 years ago
- ☆158Updated last year
- A prompt/context management system☆170Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- A multimodal, function calling powered LLM webui.☆216Updated last year
- Docker configuration for koboldcpp☆36Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- 8-bit CUDA functions for PyTorch Rocm compatible☆41Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆176Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Updated last year
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,057Updated last month
- A OpenAI API compatible REST server for llama.☆208Updated 7 months ago
- Generate Large Language Model text in a container.☆20Updated 2 years ago
- A simple webui for stable-diffusion.cpp☆40Updated this week
- Merge Transformers language models by use of gradient parameters.☆208Updated last year