microsoft / GODEL
Large-scale pretrained models for goal-directed dialog
☆867Updated last year
Alternatives and similar repositories for GODEL:
Users that are interested in GODEL are comparing it to the libraries listed below
- Large-scale pretraining for dialogue☆2,383Updated 2 years ago
- Repo for fine-tuning Casual LLMs☆454Updated last year
- Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.☆472Updated last year
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆437Updated last year
- Ongoing research training transformer models at scale☆387Updated 8 months ago
- Crosslingual Generalization through Multitask Finetuning☆532Updated 7 months ago
- ☆1,563Updated 2 years ago
- ☆458Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆351Updated last year
- ☆535Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,552Updated 2 years ago
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated last year
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆393Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆820Updated 2 years ago
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,469Updated last month
- Implementation of PersonaGPT Dialog Model☆106Updated 3 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆558Updated 11 months ago
- Official supported Python bindings for llama.cpp + gpt4all☆1,020Updated last year
- Fast Inference Solutions for BLOOM☆561Updated 6 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,861Updated last year
- LLaMA: Open and Efficient Foundation Language Models☆2,802Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- Tune any FALCON in 4-bit☆466Updated last year
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,630Updated last year
- Expanding natural instructions☆996Updated last year
- A method to fix GPT-3 after deployment with user feedback, without re-training.☆328Updated 2 years ago
- ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT☆1,203Updated 3 months ago
- An open collection of implementation tips, tricks and resources for training large language models☆472Updated 2 years ago
- ☆1,217Updated 2 years ago