WapaMario63 / GPTQ-for-LLaMa-ROCmLinks

4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.

☆32

Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm

Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below

Sorting:

evshiron / rocm_lab
DEPRECATED!
☆50Updated last year
Ph0rk0z / text-generation-webui-testing
A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Updated last year
turboderp-org / exui
Web UI for ExLlamaV2
☆511Updated 8 months ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
agrocylo / bitsandbytes-rocm
8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs
☆51Updated 2 years ago
JingShing / How-to-use-tesla-p40
A manual for helping using tesla p40 gpu
☆135Updated 11 months ago
ParisNeo / lollms_legacy
Lord of LLMS
☆294Updated last month
flurb18 / AgentOoba
An autonomous AI agent extension for Oobabooga's web ui
☆173Updated 2 years ago
nktice / AMD-AI
AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1
☆212Updated this week
wawawario2 / long_term_memory
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
☆308Updated 2 years ago
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
0cc4m / KoboldAI
☆157Updated 2 years ago
broncotc / bitsandbytes-rocm
☆37Updated 2 years ago
theroyallab / tabbyAPI-gradio-loader
A simple Gradio WebUI for loading/unloading models and loras in tabbyAPI.
☆20Updated 11 months ago
QuixiAI / dolphin-system-messages
Dolphin System Messages
☆353Updated 8 months ago
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆63Updated 2 years ago
henk717 / KoboldAI
KoboldAI is generative AI software optimized for fictional use, but capable of much more!
☆417Updated 9 months ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
melodysdreamj / WizardVicunaLM
LLM that combines the principles of wizardLM and vicunaLM
☆716Updated 2 years ago
mzbac / wizardCoder-vsc
Visual Studio Code extension for WizardCoder
☆148Updated 2 years ago
daniandtheweb / sd.cpp-webui
A simple webui for stable-diffusion.cpp
☆47Updated last week
mamei16 / LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web
☆268Updated this week
flurb18 / babyagi4all-api
BabyAGI to run with locally hosted models using the API from https://github.com/oobabooga/text-generation-webui
☆87Updated 2 years ago
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆216Updated last year
TheBlokeAI / dockerLLM
TheBloke's Dockerfiles
☆306Updated last year
theubie / complex_memory
A KoboldAI-like memory extension for oobabooga's text-generation-webui
☆107Updated last year
jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆97Updated last year
xr4dsh / CodeRunner
☆29Updated last year