amrrs / llama-4bit-colabLinks
llama-4bit-colab
☆63Updated 2 years ago
Alternatives and similar repositories for llama-4bit-colab
Users that are interested in llama-4bit-colab are comparing it to the libraries listed below
Sorting:
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆233Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- A discord bot that roleplays!☆151Updated 2 years ago
- ☆123Updated 2 years ago
- ☆64Updated 2 years ago
- ☆14Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆51Updated 2 years ago
- ☆103Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆131Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- ☆22Updated 2 years ago
- minichatgpt - To Train ChatGPT In 5 Minutes☆169Updated 2 years ago
- ☆62Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆110Updated 2 years ago
- Framework agnostic python runtime for RWKV models☆147Updated 2 years ago
- Fine-tuning 6-Billion GPT-J (& other models) with LoRA and 8-bit compression☆68Updated 3 years ago
- 📖 — Notebooks related to RWKV☆58Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆153Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆126Updated 2 years ago
- A collection of simple transformer based chatbots.☆18Updated 3 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- ☆276Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- A Simple Discord Bot for the Alpaca LLM☆99Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- Instruct-tune LLaMA on consumer hardware☆72Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆412Updated 2 years ago