lxe / cerebras-lora-alpacaLinks
LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt
☆63Updated 2 years ago
Alternatives and similar repositories for cerebras-lora-alpaca
Users that are interested in cerebras-lora-alpaca are comparing it to the libraries listed below
Sorting:
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆413Updated 2 years ago
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆100Updated last year
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆95Updated 2 years ago
- ☆33Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆143Updated 2 years ago
- minichatgpt - To Train ChatGPT In 5 Minutes☆169Updated 2 years ago
- ☆457Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆126Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆233Updated 2 years ago
- CodeGen2 models for program synthesis☆271Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 6 months ago
- starcoder server for huggingface-vscdoe custom endpoint☆179Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Visual Studio Code extension for WizardCoder☆149Updated 2 years ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆52Updated 2 years ago
- ☆276Updated 2 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆129Updated 2 years ago
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆164Updated 2 years ago
- This repo contains the data preparation, tokenization, training and inference code for BLOOMChat. BLOOMChat is a 176 billion parameter mu…☆585Updated 2 years ago
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆412Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year
- A command-line interface to generate textual and conversational datasets with LLMs.☆299Updated 2 years ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆357Updated 2 years ago
- ☆81Updated last year
- Finetune ALL LLMs with ALL Adapeters on ALL Platforms!☆331Updated 4 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Run Alpaca LLM in LangChain☆216Updated last year