facebookresearch / ToMiLinks
Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"
☆26Updated 5 years ago
Alternatives and similar repositories for ToMi
Users that are interested in ToMi are comparing it to the libraries listed below
Sorting:
- Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"☆22Updated 10 months ago
- Repository for Skill Set Optimization☆14Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Updated 2 years ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- Self-Supervised Alignment with Mutual Information☆20Updated last year
- Benchmarking Benchmark Leakage in Large Language Models☆58Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- ☆27Updated last year
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- ☆15Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Updated 3 years ago
- ☆24Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆53Updated 8 months ago
- Evaluate the Quality of Critique☆36Updated last year
- ☆13Updated 3 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆64Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 3 years ago
- DEMix Layers for Modular Language Modeling☆54Updated 4 years ago
- Neuron Activation☆26Updated last year
- ☆23Updated 3 years ago
- ☆35Updated last year
- ☆38Updated 3 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24Updated 3 years ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆55Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆64Updated 2 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Updated 2 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Updated last year