jllllll / llama-cpp-python-cuBLAS-wheelsLinks

Wheels for llama-cpp-python compiled with cuBLAS support

☆98

Alternatives and similar repositories for llama-cpp-python-cuBLAS-wheels

Users that are interested in llama-cpp-python-cuBLAS-wheels are comparing it to the libraries listed below

Sorting:

oobabooga / GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
☆130Updated 2 years ago
FartyPants / Playground
Text WebUI extension to add clever Notebooks to Chat mode
☆142Updated this week
theubie / complex_memory
A KoboldAI-like memory extension for oobabooga's text-generation-webui
☆108Updated 9 months ago
kaiokendev / superbig
A prompt/context management system
☆170Updated 2 years ago
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated last year
kanttouchthis / text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
☆155Updated last year
GiusTex / EdgeGPT
Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI
☆123Updated last year
Trojaner / text-generation-webui-stable_diffusion
Integrate image generation capabilities to text-generation-webui using Stable Diffusion.
☆55Updated last year
mamei16 / LLM_Web_search
An extension for oobabooga/text-generation-webui that enables the LLM to search the web
☆255Updated this week
brucepro / Memoir
Memoir+ a persona memory extension for Text Gen Web UI.
☆210Updated 3 weeks ago
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆110Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
danikhan632 / guidance_api
An Extension for oobabooga/text-generation-webui
☆36Updated 2 years ago
p-e-w / chatbot_clinic
Science-driven chatbot development
☆58Updated last year
wsippel / bark_tts
Oobabooga extension for Bark TTS
☆119Updated last year
TheBloke / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆37Updated last year
ChuloAI / BrainChulo
Harnessing the Memory Power of the Camelids
☆146Updated last year
DavG25 / text-generation-webui-code_syntax_highlight
An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets
☆68Updated last year
ChobPT / oobaboogas-webui-langchain_agent
Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work
☆74Updated last year
SicariusSicariiStuff / Diffusion_TTS
Diffusion_TTS extension for booga
☆66Updated last year
turboderp-org / exui
Web UI for ExLlamaV2
☆505Updated 6 months ago
dibrale / webui_autonomics
Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.
☆35Updated 2 years ago
leafspark / AutoGGUF
automatically quant GGUF models
☆190Updated last week
0cc4m / KoboldAI
☆158Updated last year
epolewski / EricLLM
A fast batching API to serve LLM models
☆185Updated last year
ouoertheo / silero-api-server
☆70Updated last week
Keith-Hon / bitsandbytes-windows
8-bit CUDA functions for PyTorch in Windows 10
☆69Updated last year
frapastique / frapuse-ai-companion-android-app
This repository represents my final assignment of "Module 3 - Android App Development" at Syntax Institut.
☆28Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
jllllll / GPTQ-for-LLaMa-CUDA
A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.
☆22Updated last year