LAION-AI / Open-Instruction-GeneralistLinks
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆209Updated last year
Alternatives and similar repositories for Open-Instruction-Generalist
Users that are interested in Open-Instruction-Generalist are comparing it to the libraries listed below
Sorting:
- ☆179Updated 2 years ago
- Datasets for Instruction Tuning of Large Language Models☆257Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆181Updated 2 years ago
- ☆98Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆104Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆177Updated 2 years ago
- ☆159Updated 2 years ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆109Updated 2 years ago
- ☆72Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆217Updated 2 years ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆214Updated last year
- DSIR large-scale data selection framework for language model training☆261Updated last year
- Simple next-token-prediction for RLHF☆226Updated 2 years ago
- ☆105Updated 2 years ago
- Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese…☆131Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆462Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆94Updated 2 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆154Updated last year
- An experimental implementation of the retrieval-enhanced language model☆75Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- Scaling Data-Constrained Language Models☆342Updated 3 months ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆158Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆201Updated last year
- ☆184Updated 2 years ago
- ☆141Updated 9 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆165Updated 2 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆82Updated last year
- ☆242Updated 2 years ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆296Updated 8 months ago
- Scalable training for dense retrieval models.☆297Updated 4 months ago