lxe / cerebras-lora-alpacaLinks
LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt
☆63Updated 2 years ago
Alternatives and similar repositories for cerebras-lora-alpaca
Users that are interested in cerebras-lora-alpaca are comparing it to the libraries listed below
Sorting:
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆95Updated 2 years ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆52Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆412Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardware☆65Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated 2 years ago
- ☆457Updated 2 years ago
- This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter mu…☆586Updated 2 years ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆164Updated 2 years ago
- ☆275Updated 2 years ago
- minichatgpt - To Train ChatGPT In 5 Minutes☆169Updated 2 years ago
- Run Alpaca LLM in LangChain☆215Updated last year
- Inference code for facebook LLaMA models with Wrapyfi support☆129Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆233Updated 2 years ago
- Langport is a language model inference service☆94Updated last year
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆126Updated 2 years ago
- ☆74Updated last year
- A command-line interface to generate textual and conversational datasets with LLMs.☆297Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆130Updated 2 years ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- ☆81Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Tune any FALCON in 4-bit☆464Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆142Updated 2 years ago
- 4 bits quantization of SantaCoder using GPTQ☆50Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆103Updated 5 months ago
- Reimplementation of the task generation part from the Alpaca paper☆118Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago