declare-lab / flan-alpacaLinks
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.
☆352Updated 2 years ago
Alternatives and similar repositories for flan-alpaca
Users that are interested in flan-alpaca are comparing it to the libraries listed below
Sorting:
- ☆458Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated 2 years ago
- Tune any FALCON in 4-bit☆465Updated last year
- ☆444Updated 2 years ago
- Repo for fine-tuning Casual LLMs☆457Updated last year
- ☆367Updated 2 years ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.☆339Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- Ask Me Anything language model prompting☆547Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆462Updated 2 years ago
- ☆180Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆227Updated 2 years ago
- Crosslingual Generalization through Multitask Finetuning☆537Updated 10 months ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆172Updated 2 years ago
- ☆270Updated 2 years ago
- PaL: Program-Aided Language Models (ICML 2023)☆503Updated 2 years ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆317Updated last year
- batched loras☆344Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆721Updated 6 months ago
- 🥤🧑🏻🚀Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization…☆232Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- ☆172Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆245Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆458Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆490Updated last year