amrrs / llama-4bit-colabLinks
llama-4bit-colab
☆64Updated 2 years ago
Alternatives and similar repositories for llama-4bit-colab
Users that are interested in llama-4bit-colab are comparing it to the libraries listed below
Sorting:
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- ☆64Updated 2 years ago
- ☆61Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆65Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆118Updated 2 years ago
- A discord bot that roleplays!☆148Updated last year
- A Simple Discord Bot for the Alpaca LLM☆100Updated last year
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- A collection of simple transformer based chatbots.☆18Updated 2 years ago
- ☆104Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- 4 bits quantization of LLaMa using GPTQ☆129Updated 2 years ago
- ChatGPT API Usage using LangChain, LlamaIndex, Guardrails, AutoGPT and more☆124Updated 9 months ago
- ☆32Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 9 months ago
- The first AI artist☆32Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆50Updated last year
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆114Updated 2 years ago
- ☆34Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.☆42Updated last year
- ☆122Updated last year
- Where we keep our notes about model training runs.☆16Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆78Updated last year
- ☆46Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆74Updated 2 years ago
- A multi-agent mind implemented using LLMs engaged in ongoing conversation☆26Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated last year
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆408Updated 2 years ago