linhduongtuan / BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json
☆184Updated last year
Alternatives and similar repositories for BLOOM-LORA:
Users that are interested in BLOOM-LORA are comparing it to the libraries listed below
- Crosslingual Generalization through Multitask Finetuning☆529Updated 6 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- Multi-language Enhanced LLaMA☆301Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆247Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated 2 years ago
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆351Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆176Updated 2 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆179Updated 2 years ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- minichatgpt - To Train ChatGPT In 5 Minutes☆167Updated last year
- Finetune BLOOM☆40Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆81Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆212Updated 10 months ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆169Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- ☆457Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- Scalable training for dense retrieval models.☆284Updated 3 weeks ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆254Updated last year
- ☆124Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆494Updated last month
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- ☆268Updated last year
- [NIPS2023] RRHF & Wombat☆804Updated last year