Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models"
☆14Apr 7, 2025Updated 11 months ago
Alternatives and similar repositories for MenatQA
Users that are interested in MenatQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NeurIPS 2025: Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs☆65Nov 21, 2025Updated 4 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆135Jul 23, 2025Updated 8 months ago
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆30Oct 5, 2025Updated 5 months ago
- Code and Data for Paper "AutoTIR: Autonomous Tools Integrated Reasoning via Reinforcement Learning"☆50Sep 4, 2025Updated 6 months ago
- Gaussian Splatting for Robotic Simulation☆23Nov 7, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025] Continual Multimodal Contrastive Learning☆24Dec 18, 2025Updated 3 months ago
- ✨✨Latest Papers about LLM-based Evaluators☆32Feb 26, 2026Updated last month
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆76Mar 3, 2022Updated 4 years ago
- ☆25Dec 12, 2025Updated 3 months ago
- A temple run game controlled using face positions.☆14Jul 29, 2021Updated 4 years ago
- Hyperbolic Structural Entropy for Graph Clustering☆25Apr 15, 2025Updated 11 months ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Jun 10, 2024Updated last year
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- TRAM: Benchmarking Temporal Reasoning for Large Language Models (Findings of ACL 2024)☆26Jun 21, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The source code and dataset of paper "Time-sensitive Retrieval-Augmented Generation for Question Answering"☆15Jan 3, 2025Updated last year
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- ☆11Dec 15, 2024Updated last year
- template CV☆10Feb 4, 2023Updated 3 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 5 months ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- ☆12Apr 29, 2022Updated 3 years ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 10 months ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An approach to perform RAG while taking into account the dynamic evolution of the data. Helpful to detect emerging trends in the data☆32Dec 30, 2023Updated 2 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 9 months ago
- Word embedding via tensor decomposition.☆23Mar 27, 2018Updated 8 years ago
- ☆23Mar 11, 2025Updated last year
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated 11 months ago
- CAMeL Dataset☆15Apr 15, 2025Updated 11 months ago
- Repository for the paper "Automating App Review Response Generation"☆11Nov 16, 2021Updated 4 years ago
- 这是为希望学习FAISS向量数据库的同学准备的全面入门指导,帮助你快速建立相关概念,更好地阅读官方文档。☆35Nov 6, 2025Updated 4 months ago
- Code for COLING 2022 long paper: Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-…☆22Dec 15, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆38Feb 6, 2026Updated last month
- ☆60Oct 27, 2025Updated 5 months ago
- Experimental AST-Based Source Code Similarity Detection Tool☆25Apr 10, 2024Updated last year
- Temporal question answering dataset for Wikidata☆14Sep 17, 2025Updated 6 months ago
- ARI (Abstract Reasoning Induction) is an innovative framework designed to enhance the temporal reasoning capabilities of Large Language M…☆13Dec 29, 2024Updated last year
- [ACL 2024] PyTorch implementation for "Stealthy Attack on Large Language Model based Recommendation"☆20Jun 19, 2024Updated last year
- ☆13Apr 24, 2022Updated 3 years ago