Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models"
β14Apr 7, 2025Updated 11 months ago
Alternatives and similar repositories for MenatQA
Users that are interested in MenatQA are comparing it to the libraries listed below
Sorting:
- NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMsβ64Nov 21, 2025Updated 3 months ago
- [NeurIPS 2025 D&B (Spotlightπ)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarioβ30Oct 5, 2025Updated 5 months ago
- β25Dec 12, 2025Updated 2 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"β136Jul 23, 2025Updated 7 months ago
- Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"β50Sep 4, 2025Updated 6 months ago
- TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)β26Jun 21, 2024Updated last year
- β¨β¨Latest Papers about LLM-based Evaluatorsβ32Feb 26, 2026Updated last week
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"β75Mar 3, 2022Updated 4 years ago
- Repository for the paper "Automating App Review Response Generation"β11Nov 16, 2021Updated 4 years ago
- π κ°λκ³ κΈΈκ² κ°λ κ±Έ λͺ©νλ‘ νλ μ± μ€ν°λβ13Feb 24, 2026Updated last week
- β12Apr 24, 2024Updated last year
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language Mβ¦β13Dec 29, 2024Updated last year
- A temple run game controlled using face positions.β14Jul 29, 2021Updated 4 years ago
- A list of Numerical Multimodal reasoning papers and their implementationβ11May 13, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- β60Feb 27, 2026Updated last week
- β11Dec 15, 2024Updated last year
- template CVβ10Feb 4, 2023Updated 3 years ago
- β12Apr 29, 2022Updated 3 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!β11Oct 16, 2024Updated last year
- Collection of lines of code for basics of clean plots in Plotly and Matplotlibβ13Feb 5, 2021Updated 5 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)β11Nov 15, 2023Updated 2 years ago
- KLMS. Redesigned.β10Aug 28, 2022Updated 3 years ago
- The source code and dataset of paper "Time-sensitive Retrieval-Augmented Generation for Question Answering"β15Jan 3, 2025Updated last year
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"β24Feb 4, 2026Updated last month
- β19Jan 19, 2026Updated last month
- Temporal question answering dataset for Wikidataβ14Sep 17, 2025Updated 5 months ago
- Undergraduate Course Explorer for KAISTβ14Feb 28, 2023Updated 3 years ago
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learningβ33Feb 6, 2026Updated last month
- Welcome to PeriFlow CLI βοΈβ12Aug 3, 2023Updated 2 years ago
- β13Apr 24, 2022Updated 3 years ago
- CAMeL Datasetβ15Apr 15, 2025Updated 10 months ago
- key-value ε€ηΊΏη¨ζε‘ε¨β14Jul 19, 2015Updated 10 years ago
- β14May 7, 2021Updated 4 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"β17Jan 12, 2024Updated 2 years ago
- Compare a generic GPT-3 based chatbot with ChatGPTβ15Mar 1, 2026Updated last week
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Aug 17, 2023Updated 2 years ago
- β23Mar 11, 2025Updated 11 months ago
- Test-driven implementation of nanoGPTβ16Dec 5, 2023Updated 2 years ago