π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
β59May 31, 2024Updated last year
Alternatives and similar repositories for fantom
Users that are interested in fantom are comparing it to the libraries listed below
Sorting:
- β37Jul 16, 2023Updated 2 years ago
- β22Nov 8, 2023Updated 2 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"β23Oct 11, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- π€ Code for our EMNLP 2020 paper: "Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness"β37Oct 12, 2020Updated 5 years ago
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- β11Sep 19, 2025Updated 6 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Larβ¦β149Feb 18, 2025Updated last year
- π₯€π§π»βπCode and dataset for our EMNLP 2023 paper - "SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualizationβ¦β240Jan 23, 2026Updated last month
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β50Dec 20, 2023Updated 2 years ago
- β12May 6, 2024Updated last year
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)β11Nov 15, 2023Updated 2 years ago
- π§π» Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playingβ¦β21Dec 20, 2024Updated last year
- DSBA code studyβ30Nov 7, 2023Updated 2 years ago
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"β22Feb 16, 2024Updated 2 years ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasksβ21May 18, 2023Updated 2 years ago
- β24Dec 2, 2023Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paperβ17Apr 19, 2024Updated last year
- β13Mar 15, 2022Updated 4 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.β66Jun 24, 2024Updated last year
- π₯ Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"β65Aug 2, 2023Updated 2 years ago
- β49Apr 4, 2025Updated 11 months ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).β44Dec 25, 2022Updated 3 years ago
- "CS224n 2021 winter" study - KoreaUniv. DSBA Labβ15Apr 18, 2022Updated 3 years ago
- Official Repository for our CVPR2024 paper: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Imagesβ15Jun 13, 2024Updated last year
- β11Oct 3, 2021Updated 4 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- π Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"β13Mar 26, 2024Updated last year
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)β19Apr 16, 2025Updated 11 months ago
- [AAAI 2025 ππ«ππ₯] MuMA-ToM: Multi-modal Multi-Agent Theory of Mindβ39Jan 23, 2025Updated last year
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).β60May 3, 2022Updated 3 years ago
- Interview-based evaluation of LLMsβ25Jan 8, 2025Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commβ¦β10May 9, 2024Updated last year
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Koreanβ48Dec 23, 2024Updated last year
- Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022β17Apr 4, 2024Updated last year
- β13Jul 13, 2018Updated 7 years ago
- Code for the paper "CoS: Enhancing Personalization and Mitigating Bias with Context Steering"β20Dec 13, 2024Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focβ¦β32Jun 13, 2024Updated last year