lxe / cerebras-lora-alpacaLinks
LoRA weights for Cerebras-GPT-2.7b finetuned on Alpaca dataset with shorter prompt
☆63Updated 2 years ago
Alternatives and similar repositories for cerebras-lora-alpaca
Users that are interested in cerebras-lora-alpaca are comparing it to the libraries listed below
Sorting:
- ☆457Updated 2 years ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆125Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆412Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆824Updated 2 years ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆52Updated 2 years ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆110Updated 2 years ago
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆144Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- React app implementing OpenAI and Google APIs to re-create behavior of the toolformer paper.☆233Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆228Updated 2 years ago
- ☆276Updated 2 years ago
- CodeGen2 models for program synthesis☆271Updated 2 years ago
- LLaMa retrieval plugin script using OpenAI's retrieval plugin☆323Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- A simple LangChain-like implementation based on Sentence Embedding+local knowledge base, with Vicuna (FastChat) serving as the LLM. Suppo…☆95Updated 2 years ago
- ☆33Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated last year
- SoTA Transformers with C-backend for fast inference on your CPU.☆308Updated 2 years ago
- ☆535Updated 2 years ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- Tune any FALCON in 4-bit☆465Updated 2 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- starcoder server for huggingface-vscdoe custom endpoint☆179Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆131Updated 2 years ago
- A lightweight, hackable, and efficient framework for training and fine-tuning language models☆186Updated last week
- Instruct-tune LLaMA on consumer hardware☆72Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆79Updated last year