WapaMario63 / GPTQ-for-LLaMa-ROCm
4 bits quantization of LLaMA using GPTQ, ported to HIP for use in AMD GPUs.
☆32Updated last year
Alternatives and similar repositories for GPTQ-for-LLaMa-ROCm:
Users that are interested in GPTQ-for-LLaMa-ROCm are comparing it to the libraries listed below
- DEPRECATED!☆52Updated 10 months ago
- A prompt/context management system☆170Updated last year
- ☆37Updated last year
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆67Updated 10 months ago
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆49Updated 2 years ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆45Updated 11 months ago
- ☆156Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Updated 8 months ago
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 5 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆126Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆175Updated last year
- Simple monkeypatch to boost AMD Navi 3 GPUs☆38Updated this week
- A simple webui for stable-diffusion.cpp☆25Updated last week
- ☆29Updated last year
- Memoir+ a persona memory extension for Text Gen Web UI.☆197Updated 3 weeks ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- C/C++ implementation of PygmalionAI/pygmalion-6b☆56Updated 2 years ago
- A community list of common phrases generated by GPT and Claude models☆78Updated last year
- A TavernUI Character extension for oobabooga's Text Generation WebUI☆65Updated 10 months ago
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Updated 2 years ago
- An extension to Oobabooga to add a simple memory function for chat☆24Updated last year
- ☆60Updated this week
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- Use local llama LLM or openai to chat, discuss/summarize your documents, youtube videos, and so on.☆152Updated 4 months ago