skywalker023 / sodaverseLinks
π₯€π§π»βπCode and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
β232Updated last year
Alternatives and similar repositories for sodaverse
Users that are interested in sodaverse are comparing it to the libraries listed below
Sorting:
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically dβ¦β300Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β163Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learnersβ116Updated 2 weeks ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)β157Updated 2 years ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Setsβ217Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)β463Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasksβ208Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such asβ¦β352Updated 2 years ago
- Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.β180Updated 2 years ago
- β180Updated 2 years ago
- β182Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuningβ244Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examplesβ214Updated last year
- β96Updated 2 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Followingβ79Updated 10 months ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AIβ504Updated 5 months ago
- A framework for few-shot evaluation of autoregressive language models.β105Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generationβ219Updated last year
- evolve llm training instruction, from english instruction to any language.β118Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningβ93Updated last year
- Scalable training for dense retrieval models.β298Updated last month
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β55Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ184Updated this week
- β172Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- β52Updated 2 years ago
- Data processing system for polyglotβ91Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionβ89Updated 8 months ago
- An experimental implementation of the retrieval-enhanced language modelβ74Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersβ131Updated last year