EternityYW/TRAM-Benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EternityYW/TRAM-Benchmark)

EternityYW / TRAM-Benchmark

TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)

☆26

Alternatives and similar repositories for TRAM-Benchmark

Users that are interested in TRAM-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EternityYW / BiasEval-LLM-MentalHealth
View on GitHub
Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models
☆12Jun 21, 2024Updated 2 years ago
EternityYW / Gemini-Commonsense-Evaluation
View on GitHub
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
☆38Jan 3, 2024Updated 2 years ago
zchuz / TimeBench
View on GitHub
The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"
☆36Jun 29, 2024Updated 2 years ago
EternityYW / Metacognitive-Prompting
View on GitHub
Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)
☆47Nov 8, 2023Updated 2 years ago
DAMO-NLP-SG / TempReason
View on GitHub
☆33Jan 11, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yizhongw / llm-temporal-alignment
View on GitHub
Methods and evaluation for aligning language models temporally
☆31Mar 2, 2024Updated 2 years ago
weiyifan1023 / MenatQA
View on GitHub
Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…
☆14Apr 7, 2025Updated last year
IBM / tempqa-wd
View on GitHub
Temporal question answering dataset for Wikidata
☆14Sep 17, 2025Updated 10 months ago
google-deepmind / streamingqa
View on GitHub
☆51Oct 10, 2023Updated 2 years ago
chenhan97 / TimeLlama
View on GitHub
The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.
☆43Nov 13, 2023Updated 2 years ago
CHLee0801 / TemporalWikiDatasets
View on GitHub
☆13Apr 24, 2022Updated 4 years ago
wenhuchen / Time-Sensitive-QA
View on GitHub
Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"
☆77Mar 3, 2022Updated 4 years ago
fengbinzhu / Doc2SoarGraph
View on GitHub
The repo of the Doc2SoarGraph framework
☆10Sep 17, 2024Updated last year
chentong0 / copy-bench
View on GitHub
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
☆14Aug 19, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
realtimeqa / realtimeqa_public
View on GitHub
☆87Updated this week
NExTplusplus / TAT-DQA
View on GitHub
TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning
☆24Sep 17, 2024Updated last year
Vinoground / Vinoground
View on GitHub
☆13Apr 13, 2026Updated 3 months ago
ITA-Solar / helita
View on GitHub
A Python library for solar physics from the Institute of Theoretical Astrophysics, University of Oslo
☆11Apr 17, 2026Updated 3 months ago
xiongsiheng / TG-LLM
View on GitHub
[ACL 24 main] Large Language Models Can Learn Temporal Reasoning
☆71Apr 11, 2026Updated 3 months ago
tsor13 / kaleido
View on GitHub
☆24Mar 8, 2024Updated 2 years ago
isShayulajiao / CCL25-Eval-ZhengMing
View on GitHub
☆10Oct 25, 2025Updated 8 months ago
sylvain-wei / TIME
View on GitHub
[NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario
☆32Oct 5, 2025Updated 9 months ago
zsLin177 / camr
View on GitHub
The system of SUDA-HUAWEI submitted at CAMR2022.
☆12Nov 22, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ScialdoneLab / CIARA_python
View on GitHub
Implementation of entropy of mixing algorithm in python
☆10Oct 19, 2022Updated 3 years ago
google-research-datasets / TimeDial
View on GitHub
Temporal Commonsense Reasoning in Dialog
☆73Jun 9, 2021Updated 5 years ago
HLR / SpartQA_generation
View on GitHub
Generating SpartQA dataset
☆16May 3, 2023Updated 3 years ago
1171-jpg / BrainTeaser
View on GitHub
☆17Feb 1, 2024Updated 2 years ago
jranek / EVI
View on GitHub
Integrating temporal gene expression modalities for trajectory inference and disease prediction
☆11Sep 20, 2022Updated 3 years ago
xiusic / MinPrompt
View on GitHub
MinPrompt: Graph-based Minimal Prompt Data Augmentation for Few-shot Question Answering
☆14May 3, 2024Updated 2 years ago
GeorgeLuImmortal / RDL-Rationales-centric-Double-robustness-Learning
View on GitHub
☆12Jun 29, 2024Updated 2 years ago
lancopku / Augmented_Data_for_FST
View on GitHub
The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).
☆12May 14, 2020Updated 6 years ago
dtsoucas / GiniClust2
View on GitHub
☆12Jul 13, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MANGA-UOFA / Prompt-Edit
View on GitHub
An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"
☆13Dec 9, 2023Updated 2 years ago
njuguoyang / magnetic_modeling_codes
View on GitHub
FORTRAN and IDL codes to analyze solar magnetic field observations and construct magnetic models
☆25Jun 11, 2024Updated 2 years ago
AnWang-AI / towe-eacl
View on GitHub
☆13Jan 9, 2022Updated 4 years ago
jiayingwu19 / PSA
View on GitHub
Data and code for "Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks" (ECML-PKDD 2022)
☆11Jun 12, 2023Updated 3 years ago
MikeWangWZHL / Paxion
View on GitHub
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆38May 23, 2023Updated 3 years ago
SamuelGabriel / sqlbpe
View on GitHub
The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.
☆14Feb 26, 2020Updated 6 years ago
neuralmind-ai / information-extraction-t5
View on GitHub
☆12Apr 29, 2022Updated 4 years ago