broncotc / bitsandbytes-rocm
☆37Updated last year
Alternatives and similar repositories for bitsandbytes-rocm:
Users that are interested in bitsandbytes-rocm are comparing it to the libraries listed below
- 8-bit CUDA functions for PyTorch Rocm compatible☆39Updated last year
- 4 bits quantization of LLMs using GPTQ☆48Updated last year
- ☆156Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated 11 months ago
- Efficient 3bit/4bit quantization of LLaMA models☆19Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 5 months ago
- C/C++ implementation of PygmalionAI/pygmalion-6b☆56Updated last year
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆65Updated last year
- Conversational Language model toolkit for training against human preferences.☆42Updated 11 months ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- ChatGPT-like Web UI for RWKVstic☆100Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Train Llama Loras Easily☆31Updated last year
- mikugg is a Frontend for "Generative Visual Novels"☆144Updated last week
- SD-based Anifusion☆17Updated 2 years ago
- 8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs☆49Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- Framework agnostic python runtime for RWKV models☆145Updated last year
- Oobabooga extension for Bark TTS☆118Updated last year
- A community list of common phrases generated by GPT and Claude models☆78Updated last year
- Falcon7B + Falcon40B support - in branch falcon40b. Now all good and working. But main action now in https://github.com/cmp-nct/ggllm.cpp☆11Updated last year
- Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆97Updated 7 months ago
- ☆32Updated last month
- Traing PRO extension for oobabooga WebUI - recent dev version☆48Updated 2 months ago
- A prompt/context management system☆169Updated last year
- Where we keep our notes about model training runs.☆16Updated 2 years ago