jllllll / llama-cpp-python-cuBLAS-wheels
Wheels for llama-cpp-python compiled with cuBLAS support
☆94Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama-cpp-python-cuBLAS-wheels
- An unsupervised model merging algorithm for Transformers-based language models.☆100Updated 6 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- A prompt/context management system☆165Updated last year
- Science-driven chatbot development☆55Updated 6 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- Text WebUI extension to add clever Notebooks to Chat mode☆133Updated 10 months ago
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆107Updated 3 weeks ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- web search extension for text-generation-webui☆94Updated 8 months ago
- 4 bits quantization of LLMs using GPTQ☆47Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆66Updated last year
- Merge Transformers language models by use of gradient parameters.☆201Updated 3 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆162Updated 6 months ago
- ☆150Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆145Updated last year
- Web UI for ExLlamaV2☆445Updated last month
- An extension for oobabooga's text-generation-webui that adds syntax highlighting to code snippets☆64Updated 5 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 3 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web using DuckDuckGo☆173Updated this week
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year
- Memoir+ a persona extension for Text Gen Web UI. That includes memory, emotions, command handling and more.☆171Updated last month
- Wheels for llama-cpp-python compiled with cuBLAS support☆18Updated last month
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆51Updated 6 months ago
- 8-bit CUDA functions for PyTorch in Windows 10☆71Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆155Updated last year