๐ฅค๐ง๐ปโ๐Code and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization"
โ239Jan 23, 2026Updated last month
Alternatives and similar repositories for sodaverse
Users that are interested in sodaverse are comparing it to the libraries listed below
Sorting:
- ๐ฅ Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"โ65Aug 2, 2023Updated 2 years ago
- ๐ค Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"โ37Oct 12, 2020Updated 5 years ago
- ๐ธ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"โ22Sep 5, 2023Updated 2 years ago
- ๐ป Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"โ59May 31, 2024Updated last year
- ๋ชจ๋์ ๋ง๋ญ์น ๋ฐ์ดํฐ๋ฅผ ๋ถ์์ ํธ๋ฆฌํ ํํ๋ก ๋ณํํ๋ ๊ธฐ๋ฅ์ ์ ๊ณตํฉ๋๋ค.โ11Mar 2, 2022Updated 4 years ago
- Code and data for "KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark" (LREC-COLINGโฆโ17Apr 15, 2025Updated 10 months ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuningโ100May 6, 2023Updated 2 years ago
- ๐ค Code for our EMNLP 2021 paper: "Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes"โ76Mar 22, 2022Updated 3 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)โ11Nov 15, 2023Updated 2 years ago
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"โ20May 27, 2024Updated last year
- NSMC, KorSTS ... fine-tuningsโ18Feb 23, 2022Updated 4 years ago
- โ180Feb 23, 2023Updated 3 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesโ53Aug 10, 2025Updated 6 months ago
- DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversationsโ62Jul 25, 2023Updated 2 years ago
- ๐ง๐ป Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playingโฆโ21Dec 20, 2024Updated last year
- KOLD: Korean Offensive Language Datasetโ81Nov 13, 2022Updated 3 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learningโ193Jul 9, 2025Updated 7 months ago
- โ21Apr 16, 2022Updated 3 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)โ249Jun 29, 2023Updated 2 years ago
- Reward Model์ ์ด์ฉํ์ฌ ์ธ์ด๋ชจ๋ธ์ ๋ต๋ณ์ ํ๊ฐํ๊ธฐโ29Feb 23, 2024Updated 2 years ago
- โ22Oct 22, 2023Updated 2 years ago
- ๐ ์์ธ๋ ์ปดํจํฐ๊ณตํ๋ถ (์ปด๊ณต) ํ์ ๋ ผ๋ฌธ ํ ํ๋ฆฟ | Thesis template for SNU CSEโ16Jan 5, 2026Updated 2 months ago
- Korean text data preprocess toolkit for NLPโ18Jun 11, 2019Updated 6 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)โ119Oct 8, 2020Updated 5 years ago
- Experiments with generating opensource language model assistantsโ97May 14, 2023Updated 2 years ago
- [KO-Platy๐ฅฎ] Korean-Open-platypus๋ฅผ ํ์ฉํ์ฌ llama-2-ko๋ฅผ fine-tuningํ KO-platypus modelโ73Aug 24, 2025Updated 6 months ago
- Data and Code for Paper "Reflect Not Reflex: Inference-Based Common Ground Improves Dialogue Response Quality" (EMNLP 2022)โ11Nov 28, 2022Updated 3 years ago
- โ15May 15, 2021Updated 4 years ago
- โ11Sep 19, 2025Updated 5 months ago
- ๐ค Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasetโฆโ16Oct 7, 2024Updated last year
- โ11Oct 3, 2021Updated 4 years ago
- โ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.โ23Jul 1, 2021Updated 4 years ago
- Interview-based evaluation of LLMsโ25Jan 8, 2025Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AIโ520Jan 27, 2025Updated last year
- Sotopia-ฯ: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)โ81May 7, 2024Updated last year
- huggingface๋ฅผ ์ด์ฉํ์ฌ downstream task ์ํํ๊ธฐโ62Dec 28, 2021Updated 4 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuningโ98Apr 26, 2023Updated 2 years ago
- The source code of ExFunTubeโ10Aug 8, 2025Updated 6 months ago
- baikal.ai's pre-trained BERT models: descriptions and sample codesโ12Jun 24, 2021Updated 4 years ago