facebookresearch / ToMiLinks
Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"
☆25Updated 5 years ago
Alternatives and similar repositories for ToMi
Users that are interested in ToMi are comparing it to the libraries listed below
Sorting:
- Repository for Skill Set Optimization☆14Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated 2 years ago
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Updated 7 months ago
- DEMix Layers for Modular Language Modeling☆54Updated 4 years ago
- This repository contains data, code and models for contextual noncompliance.☆24Updated last year
- ☆13Updated 3 years ago
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Updated last year
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆35Updated 2 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 2 years ago
- ☆35Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- ☆27Updated last year
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24Updated 3 years ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆13Updated 3 years ago
- ☆46Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Updated 2 years ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆27Updated 3 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆27Updated last year
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Updated 10 months ago
- Evaluate the Quality of Critique☆36Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated last year
- Tasks for describing differences between text distributions.☆17Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆36Updated 2 years ago
- ☆17Updated 7 months ago
- ☆103Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆62Updated 2 years ago