Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
Alternatives and similar repositories for EvolvingQA_benchmark
Users that are interested in EvolvingQA_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆23Nov 19, 2025Updated 7 months ago
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆91Oct 11, 2022Updated 3 years ago
- ☆13Jul 31, 2023Updated 2 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆75May 15, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- (CVPR 2023) Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning☆30Oct 3, 2023Updated 2 years ago
- Methods and evaluation for aligning language models temporally☆31Mar 2, 2024Updated 2 years ago
- Statistics and Accepted paper list of ACL 2020 with arXiv link☆23May 30, 2020Updated 6 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 11 months ago
- Official implementation of AsmDepictor, "A Transformer-based Function Symbol Name Inference Model from an Assembly Language for Binary Re…☆29Apr 30, 2024Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- ☆12Feb 11, 2026Updated 4 months ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation