amrrs / llama-4bit-colab
llama-4bit-colab
☆65Updated 2 years ago
Alternatives and similar repositories for llama-4bit-colab:
Users that are interested in llama-4bit-colab are comparing it to the libraries listed below
- ☆65Updated 2 years ago
- ☆61Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)☆76Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt☆63Updated 2 years ago
- Alpaca Lora☆26Updated last year
- ☆106Updated last year
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- ☆33Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- A dead simple way to call the ChatGPT API from your machine☆70Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- A collection of simple transformer based chatbots.☆18Updated 2 years ago
- ☆69Updated 6 months ago
- Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.☆49Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- Inference code for LLaMA 2 models☆30Updated 9 months ago
- ☆147Updated last year
- A discord bot that roleplays!☆148Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Merge LLM that are split in to parts☆26Updated last year
- 📖 — Notebooks related to RWKV☆59Updated last year
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- ☆48Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- ☆122Updated last year