amrrs / llama-4bit-colab
llama-4bit-colab
☆65Updated 2 years ago
Alternatives and similar repositories for llama-4bit-colab:
Users that are interested in llama-4bit-colab are comparing it to the libraries listed below
- ☆64Updated 2 years ago
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- A discord bot that roleplays!☆148Updated last year
- ☆33Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆130Updated last year
- ChatGPT API Usage using LangChain, LlamaIndex, Guardrails, AutoGPT and more☆125Updated 8 months ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- 📖 — Notebooks related to RWKV☆59Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆230Updated 9 months ago
- ☆62Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆34Updated 2 years ago
- A reverse engineered Python API wrapper for OpenPlayground (nat.dev)☆76Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- ☆106Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆38Updated 2 years ago
- ☆122Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 8 months ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆51Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- Hosting the JSON for the GPT4 Tokenizer☆64Updated 2 years ago
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆90Updated last year
- A collection of simple transformer based chatbots.☆18Updated 2 years ago