linhduongtuan / BLOOM-LORA
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json
☆184Updated last year
Related projects ⓘ
Alternatives and complementary repositories for BLOOM-LORA
- Crosslingual Generalization through Multitask Finetuning☆516Updated 2 months ago
- A minimum example of aligning language models with RLHF similar to ChatGPT☆214Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- Instruct-tune Open LLaMA / RedPajama / StableLM models on consumer hardware using QLoRA☆80Updated 11 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆207Updated 8 months ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- Unofficial implementation of AlpaGasus☆84Updated last year
- Official repository for LongChat and LongEval☆512Updated 6 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆237Updated 11 months ago
- ☆263Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆349Updated last year
- Multi-language Enhanced LLaMA☆301Updated last year
- A Multilingual Replicable Instruction-Following Model☆94Updated last year
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆240Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆208Updated 6 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆623Updated 9 months ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆169Updated last year
- ☆454Updated last year
- Scripts for fine-tuning Llama2 via SFT and DPO.☆186Updated last year
- Code and models for BERT on STILTs☆53Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆164Updated last year
- ☆175Updated last year
- Open efforts to implement ChatGPT-like models and beyond.☆105Updated 4 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆220Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆208Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆197Updated 6 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago