Reason-Wang / flan-alpaca-loraLinks
This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.
☆51Updated 2 years ago
Alternatives and similar repositories for flan-alpaca-lora
Users that are interested in flan-alpaca-lora are comparing it to the libraries listed below
Sorting:
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆177Updated 2 years ago
- ☆44Updated 2 years ago
- A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆60Updated 2 years ago
- Scripts for fine-tuning Llama2 via SFT and DPO.☆206Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆260Updated 2 years ago
- Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in datase…☆53Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆216Updated last year
- ☆33Updated 2 years ago
- Benchmark baseline for retrieval qa applications☆118Updated last year
- This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.☆73Updated 3 years ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Updated last year
- ☆52Updated 2 years ago
- ☆123Updated 2 years ago
- Text classification with Foundation Language Model LLaMA☆113Updated 2 years ago
- ☆56Updated 2 years ago
- 首个中文心理咨询对话安全检测数据集☆22Updated 2 years ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Updated last year
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆131Updated 2 years ago
- ☆70Updated 2 years ago
- ☆180Updated 2 years ago
- ☆173Updated 2 years ago
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆248Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated 2 years ago
- ☆98Updated 2 years ago
- Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness☆144Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆173Updated 2 years ago
- Unofficial implementation of AlpaGasus☆94Updated 2 years ago
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆190Updated 11 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆284Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago