broncotc / bitsandbytes-rocmLinks

☆37

Alternatives and similar repositories for bitsandbytes-rocm

Users that are interested in bitsandbytes-rocm are comparing it to the libraries listed below

Sorting:

arlo-phoenix / bitsandbytes-rocm-5.6
8-bit CUDA functions for PyTorch Rocm compatible
☆41Updated last year
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
olealgoritme / gddr6
Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.
☆104Updated 6 months ago
wawawario2 / long_term_memory
A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.
☆308Updated 2 years ago
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
shawwn / llama
Inference code for LLaMA models
☆189Updated 2 years ago
0cc4m / KoboldAI
☆156Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
pointnetwork / point-alpaca
☆403Updated 2 years ago
harrisonvanderbyl / rwkv-cpp-accelerated
A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…
☆313Updated last year
0cc4m / GPTQ-for-LLaMa
4 bits quantization of LLMs using GPTQ
☆49Updated 2 years ago
henk717 / KoboldAI
KoboldAI is generative AI software optimized for fictional use, but capable of much more!
☆417Updated 10 months ago
RedTopper / Text-Generation-Webui-Podman
Generate Large Language Model text in a container.
☆19Updated 2 years ago
NolanoOrg / cformers
SoTA Transformers with C-backend for fast inference on your CPU.
☆308Updated last year
darkhemic / stable-diffusion-cpuonly
a fork that installs runs on pytorch cpu-only
☆214Updated 2 years ago
AlpinDale / RPTQ-for-LLaMA
Efficient 3bit/4bit quantization of LLaMA models
☆19Updated 2 years ago
venuatu / llama
Inference code for LLaMA models
☆46Updated 2 years ago
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆100Updated 2 years ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆109Updated 2 years ago
AlpinDale / pygmalion.cpp
C/C++ implementation of PygmalionAI/pygmalion-6b
☆56Updated 2 years ago
PygmalionAI / gradio-ui
Prototype UI for chatting with the Pygmalion models.
☆235Updated 2 years ago
agrocylo / bitsandbytes-rocm
8-bit CUDA functions for PyTorch, ported to HIP for use in AMD GPUs
☆51Updated 2 years ago
bjoernpl / llama_gradio_interface
Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT
☆48Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆146Updated 2 years ago
l1na-forever / stable-diffusion-rocm-docker
Stable Diffusion Docker image preconfigured for usage with AMD Radeon cards
☆140Updated last year
wsippel / bark_tts
Oobabooga extension for Bark TTS
☆119Updated last year
xuhuisheng / rocm-build
build scripts for ROCm
☆188Updated last year
oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆130Updated 2 years ago
EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆358Updated last year
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago