togethercomputer / OpenDataHub
☆124Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OpenDataHub
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- ☆147Updated 3 years ago
- ☆175Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆221Updated last year
- Pipeline for pulling and processing online language model pretraining data from the web☆174Updated last year
- Crosslingual Generalization through Multitask Finetuning☆516Updated 2 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆304Updated last year
- ☆263Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆220Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆348Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆213Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆457Updated 2 years ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆169Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆237Updated 11 months ago
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆164Updated 7 months ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- ☆171Updated last year
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆113Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- ☆86Updated 2 years ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 3 months ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆110Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆207Updated 8 months ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.☆177Updated 2 years ago
- SAIL: Search Augmented Instruction Learning☆160Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆112Updated last year
- Code and models for BERT on STILTs☆53Updated last year