jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆95Updated 11 months ago
Alternatives and similar repositories for llama-cpp-python-cuBLAS-wheels:
Users that are interested in llama-cpp-python-cuBLAS-wheels are comparing it to the libraries listed below
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆112Updated last year
- A prompt/context management system☆167Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆107Updated 2 months ago
- Science-driven chatbot development☆56Updated 8 months ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆72Updated last year
- Harnessing the Memory Power of the Camelids☆146Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆65Updated 7 months ago
- Extension for Text Generation Webui based on EdgeGPT, a reverse engineered API of Microsoft's Bing Chat AI☆124Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 11 months ago
- automatically quant GGUF models☆151Updated this week
- 4 bits quantization of LLaMa using GPTQ☆131Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆101Updated 8 months ago
- Text WebUI extension to add clever Notebooks to Chat mode☆140Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- Web UI for ExLlamaV2☆461Updated 3 weeks ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated last year
- A fast batching API to serve LLM models☆177Updated 8 months ago
- This plugin forces models to output JSON of a specified schema using JSONFormer☆26Updated 2 months ago
- web search extension for text-generation-webui☆99Updated 10 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆50Updated 2 months ago
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆69Updated 6 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year
- A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.☆22Updated last year
- An autonomous AI agent extension for Oobabooga's web ui☆176Updated last year
- Implements harmful/harmless refusal removal using pure HF Transformers☆112Updated 7 months ago