Code for ExploreTom
โ93Jun 25, 2025Updated last year
Alternatives and similar repositories for ExploreToM
Users that are interested in ExploreToM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2025 ๐๐ซ๐๐ฅ] MuMA-ToM: Multi-modal Multi-Agent Theory of Mindโ41Jan 23, 2025Updated last year
- โ21Oct 11, 2025Updated 8 months ago
- โ23Nov 8, 2023Updated 2 years ago
- ๐ฆพ EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automaticโฆโ92Feb 5, 2026Updated 4 months ago
- ๐ป Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"โ62May 31, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Large Concept Models: Language modeling in a sentence representation spaceโ2,366Jan 29, 2025Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogsโ25Feb 26, 2024Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.โ20Jun 3, 2024Updated 2 years ago
- Evaluating Reward Models in Multilingual Settings (ACL Main '25)โ42May 16, 2025Updated last year
- ๅๅๆจๆญๅจๆ่ฒๆต้ๆจกๅไธญ็ๅบ็จ๏ผ้กน็ฎๅๅบ็่ฎบ๏ผ่ฎค็ฅ่ฏๆญๆจกๅ๏ผ variational inference for psychometrics model (item response theoy, cognitive diagnosis models)โ18Jul 6, 2023Updated 2 years ago
- Implementation of Monte Carlo Tree Searchโ15Aug 4, 2022Updated 3 years ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).โ375Updated this week
- Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"โ29Aug 9, 2020Updated 5 years ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizersโ36Apr 18, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- โ17Apr 7, 2025Updated last year
- When Reasoning Meets Its Lawsโ37Jan 2, 2026Updated 5 months ago
- Code for verifying deep neural feature ansatzโ22May 3, 2023Updated 3 years ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"โ46Nov 26, 2025Updated 7 months ago
- [NeurIPS 2025 ๐๐ฉ๐จ๐ญ๐ฅ๐ข๐ ๐ก๐ญ] AutoToM: Scaling Model-based Mental Inference via Automated Agent Modelingโ45Mar 28, 2026Updated 3 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillationโ73Oct 17, 2025Updated 8 months ago
- โ26Mar 21, 2024Updated 2 years ago
- The first behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.โ772Jun 10, 2025Updated last year
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"โ121Jun 27, 2025Updated last year
- Proton VPN Special Offer - Get 70% off โข AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An open source implementation of CLIPโ22Nov 6, 2024Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"โ43Oct 1, 2024Updated last year
- โ11Sep 10, 2023Updated 2 years ago
- A deep research frameworkโ31Apr 21, 2026Updated 2 months ago
- Korean Benchmark for Korean Legal Language Understandingโ19Nov 16, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixingโ32Jan 17, 2025Updated last year
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"โ20Oct 26, 2024Updated last year
- Clue inspired puzzles for testing LLM deduction abilitiesโ47Mar 19, 2026Updated 3 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"โ11Oct 27, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- โ39Jul 16, 2023Updated 2 years ago
- โ12Nov 2, 2021Updated 4 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalizationโ19Mar 7, 2025Updated last year
- DPO, but faster ๐โ52Dec 6, 2024Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.โ114Mar 26, 2026Updated 3 months ago
- โ11Feb 9, 2024Updated 2 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.โ68Jun 24, 2024Updated 2 years ago