Code for ExploreTom
☆91Jun 25, 2025Updated 8 months ago
Alternatives and similar repositories for ExploreToM
Users that are interested in ExploreToM are comparing it to the libraries listed below
Sorting:
- [AAAI 2025 𝐎𝐫𝐚𝐥] MuMA-ToM: Multi-modal Multi-Agent Theory of Mind☆39Jan 23, 2025Updated last year
- ☆16Oct 11, 2025Updated 5 months ago
- ☆22Nov 8, 2023Updated 2 years ago
- ☆12Jan 25, 2024Updated 2 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Oct 11, 2024Updated last year
- ☆43May 29, 2025Updated 9 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆92Feb 5, 2026Updated last month
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆22Feb 26, 2024Updated 2 years ago
- Implementation of Monte Carlo Tree Search☆15Aug 4, 2022Updated 3 years ago
- Large Concept Models: Language modeling in a sentence representation space☆2,342Jan 29, 2025Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)☆41May 16, 2025Updated 10 months ago
- ☆29Nov 9, 2025Updated 4 months ago
- Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"☆26Aug 9, 2020Updated 5 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated 11 months ago
- ☆17Apr 7, 2025Updated 11 months ago
- [NeurIPS 2025 𝐒𝐩𝐨𝐭𝐥𝐢𝐠𝐡𝐭] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆40Dec 26, 2025Updated 2 months ago
- When Reasoning Meets Its Laws☆36Jan 2, 2026Updated 2 months ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆149Feb 18, 2025Updated last year
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation☆71Oct 17, 2025Updated 5 months ago
- ☆26Mar 21, 2024Updated 2 years ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆119Jun 27, 2025Updated 8 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆18Oct 26, 2024Updated last year
- Korean Benchmark for Korean Legal Language Understanding☆17Nov 16, 2024Updated last year
- Clue inspired puzzles for testing LLM deduction abilities☆46Updated this week
- 🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful résumé and grow your followers using Hugo and GitHub. No code.☆26Feb 19, 2026Updated last month
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- Training GPTs to solve interaction nets☆18Aug 14, 2024Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Apr 17, 2025Updated 11 months ago
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆66Jun 24, 2024Updated last year
- A summarizer for Japanese articles (but ChatGPT is better)☆10Aug 1, 2022Updated 3 years ago
- Pretraining Code for METAGENE-1☆70Jan 6, 2025Updated last year
- ☆10Nov 15, 2023Updated 2 years ago