TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)
☆26Jun 21, 2024Updated last year
Alternatives and similar repositories for TRAM-Benchmark
Users that are interested in TRAM-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models☆12Jun 21, 2024Updated last year
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆37Jan 3, 2024Updated 2 years ago
- ☆33Jan 11, 2024Updated 2 years ago
- Methods and evaluation for aligning language models temporally☆30Mar 2, 2024Updated 2 years ago
- Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…☆14Apr 7, 2025Updated 11 months ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- ☆49Oct 10, 2023Updated 2 years ago
- The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.☆43Nov 13, 2023Updated 2 years ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆75Mar 3, 2022Updated 4 years ago
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year
- ☆86Updated this week
- ☆13Aug 7, 2025Updated 7 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 7 months ago
- [ACL 24 main] Large Language Models Can Learn Temporal Reasoning☆67Dec 13, 2024Updated last year
- [ECCV 2022] Learning Instance-Specific Adaptation for Cross-Domain Segmentation☆14Jul 17, 2022Updated 3 years ago
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)☆32Jul 3, 2024Updated last year
- ☆23Mar 8, 2024Updated 2 years ago
- ☆10Oct 25, 2025Updated 4 months ago
- Implementation of entropy of mixing algorithm in python☆10Oct 19, 2022Updated 3 years ago
- An exploration of LLM steering☆25Jun 15, 2024Updated last year
- Temporal Commonsense Reasoning in Dialog☆72Jun 9, 2021Updated 4 years ago
- Integrating temporal gene expression modalities for trajectory inference and disease prediction☆10Sep 20, 2022Updated 3 years ago
- Data and code for "Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks" (ECML-PKDD 2022)☆11Jun 12, 2023Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- ☆12Oct 4, 2023Updated 2 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- FORTRAN and IDL codes to analyze solar magnetic field observations and construct magnetic models☆22Jun 11, 2024Updated last year
- ☆12Apr 29, 2022Updated 3 years ago
- 多任务学习相关资料,论文,代码☆15Jan 22, 2019Updated 7 years ago
- ☆13Jan 9, 2022Updated 4 years ago
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago
- A framework for editing the CoTs for better factuality☆51Dec 9, 2023Updated 2 years ago
- ☆16Feb 1, 2024Updated 2 years ago
- [NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition☆19May 26, 2024Updated last year
- ☆11Dec 6, 2020Updated 5 years ago