π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
β61May 31, 2024Updated last year
Alternatives and similar repositories for fantom
Users that are interested in fantom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β38Jul 16, 2023Updated 2 years ago
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"β23Oct 11, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lβ¦β45Jun 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- π€« Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Conβ¦β54Dec 20, 2023Updated 2 years ago
- β12May 6, 2024Updated last year
- β How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.β23Jul 1, 2021Updated 4 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)β11Nov 15, 2023Updated 2 years ago
- π§π» Code and benchmark for our Findings of ACL 2024 paper - "TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playingβ¦β21Dec 20, 2024Updated last year
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"β22Feb 16, 2024Updated 2 years ago
- β24Dec 2, 2023Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paperβ17Apr 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β13Mar 15, 2022Updated 4 years ago
- π₯ Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"β65Aug 2, 2023Updated 2 years ago
- The source code of ExFunTubeβ10Aug 8, 2025Updated 8 months ago
- Official code and dataset repository of KoBBQ (TACL 2024)β19May 13, 2024Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).β44Dec 25, 2022Updated 3 years ago
- "CS224n 2021 winter" study - KoreaUniv. DSBA Labβ15Apr 18, 2022Updated 4 years ago
- Official Repository for our CVPR2024 paper: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Imagesβ15Jun 13, 2024Updated last year
- β11Oct 3, 2021Updated 4 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)β20Apr 16, 2025Updated last year
- [AAAI 2025 ππ«ππ₯] MuMA-ToM: Multi-modal Multi-Agent Theory of Mindβ40Jan 23, 2025Updated last year
- Interview-based evaluation of LLMsβ26Jan 8, 2025Updated last year
- β19Oct 11, 2025Updated 6 months ago
- Implementation of Variational Hierarchical User-based Conversation Modelβ10Jul 2, 2021Updated 4 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Commβ¦β10May 9, 2024Updated last year
- CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Koreanβ48Dec 23, 2024Updated last year
- Context-Sensitive Misspelling Correction of Clinical Text via Conditional Independence, CHIL 2022β17Apr 4, 2024Updated 2 years ago
- Official PyTorch implementation of "Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback" (EMNLP 2024 Main Oral)β26Oct 15, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β33Aug 30, 2023Updated 2 years ago
- Official datasets and pytorch implementation repository of SQuARe and KoSBi (ACL 2023)β251Jun 29, 2023Updated 2 years ago
- β24Oct 31, 2018Updated 7 years ago
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"β23Nov 19, 2025Updated 5 months ago
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)β18Nov 24, 2022Updated 3 years ago
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Modelsβ Theory-of-Mind Capabilities".β23Aug 16, 2023Updated 2 years ago
- β20Mar 12, 2025Updated last year