jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆96Updated last year
Alternatives and similar repositories for llama-cpp-python-cuBLAS-wheels:
Users that are interested in llama-cpp-python-cuBLAS-wheels are comparing it to the libraries listed below
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆65Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 5 months ago
- Science-driven chatbot development☆56Updated 10 months ago
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆126Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 5 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated 11 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- A prompt/context management system☆169Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆112Updated last year
- Diffusion_TTS extension for booga☆67Updated 9 months ago
- Traing PRO extension for oobabooga WebUI - recent dev version☆48Updated 2 months ago
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆66Updated 9 months ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆54Updated 10 months ago
- A fast batching API to serve LLM models☆183Updated 11 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- ☆156Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆70Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆310Updated last year
- 8-bit CUDA functions for PyTorch in Windows 10☆68Updated last year
- The code we currently use to fine-tune models.☆114Updated 10 months ago
- Web UI for ExLlamaV2☆486Updated last month
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆90Updated last year
- ☆27Updated last year
- Wheels for llama-cpp-python compiled with cuBLAS support☆21Updated 3 weeks ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆65Updated 5 months ago