togethercomputer / OpenDataHub
☆124Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OpenDataHub
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- ☆263Updated last year
- Crosslingual Generalization through Multitask Finetuning☆515Updated last month
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆220Updated last year
- ☆343Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆211Updated last year
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago
- ☆175Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆348Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆113Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆206Updated 8 months ago
- Fast Inference Solutions for BLOOM☆560Updated last month
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆149Updated 4 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 5 months ago
- Pipeline for pulling and processing online language model pretraining data from the web☆174Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆236Updated 11 months ago
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆181Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆169Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆220Updated last year
- ☆147Updated 3 years ago
- Experiments with generating opensource language model assistants☆97Updated last year
- Pre-training code for Amber 7B LLM☆152Updated 6 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated last year
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆110Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year