microsoft / GODEL
Large-scale pretrained models for goal-directed dialog
☆849Updated 9 months ago
Related projects: ⓘ
- Large-scale pretraining for dialogue☆2,343Updated last year
- Repo for fine-tuning Casual LLMs☆449Updated 5 months ago
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆459Updated 6 months ago
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆810Updated last year
- Crosslingual Generalization through Multitask Finetuning☆510Updated last year
- ☆1,469Updated last year
- Expanding natural instructions☆941Updated 9 months ago
- An open-source implementation of Google's PaLM models☆804Updated 2 months ago
- Alpaca dataset from Stanford, cleaned and curated☆1,493Updated last year
- ☆1,456Updated 3 weeks ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,442Updated 8 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,368Updated last month
- Toolkit for creating, sharing and using natural language prompts.☆2,644Updated 10 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,561Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆1,941Updated last month
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,210Updated 3 weeks ago
- ☆2,635Updated last week
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆428Updated last year
- Tune any FALCON in 4-bit☆469Updated last year
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆867Updated 3 months ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆2,815Updated 2 months ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,601Updated last year
- ☆406Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆6,819Updated this week
- Chat with Meta's LLaMA models at home made easy☆835Updated last year
- Ongoing research training transformer models at scale☆371Updated 3 weeks ago
- ☆1,166Updated last year
- Fast Inference Solutions for BLOOM☆556Updated last month
- A method to fix GPT-3 after deployment with user feedback, without re-training.☆325Updated last year
- ☆1,411Updated last year