s4rduk4r / alpaca_lora_4bit_readmeLinks
Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit
☆31Updated 2 years ago
Alternatives and similar repositories for alpaca_lora_4bit_readme
Users that are interested in alpaca_lora_4bit_readme are comparing it to the libraries listed below
Sorting:
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- A Qt GUI for large language models☆43Updated last year
- A fork of textgen that kept some things like Exllama and old GPTQ.☆22Updated 10 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- ☆27Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆110Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- ☆73Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated 2 years ago
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 7 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- A simple batch file to make the oobabooga one click installer compatible with llama 4bit models and able to run on cuda☆21Updated 2 years ago
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆71Updated 2 years ago
- A gradio web UI for running Large Language Models like GPT-J 6B, OPT, GALACTICA, LLaMA, and Pygmalion.☆309Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- ChatGPT-like Web UI for RWKVstic☆100Updated 2 years ago
- Little AI roleplay program☆58Updated last year
- A discord bot that roleplays!☆149Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Collection of various text datasets to assist ML researchers in training or fine-tuning their models☆20Updated 2 years ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- ☆16Updated 2 years ago