kimyuji / EvolvingQA_benchmarkView external linksLinks
Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
Alternatives and similar repositories for EvolvingQA_benchmark
Users that are interested in EvolvingQA_benchmark are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"☆22Nov 19, 2025Updated 2 months ago
- ☆13Jul 31, 2023Updated 2 years ago
- The implementation of <Factual Consistency Evaluation for Text Summarization via Counterfactual Estimation> in PyTorch.☆17Nov 11, 2021Updated 4 years ago
- [ICLR 2022] Towards Continual Knowledge Learning of Language Models☆92Oct 11, 2022Updated 3 years ago
- ☆11Mar 13, 2025Updated 11 months ago
- (CVPR 2023) Coreset Sampling from Open-Set for Fine-Grained Self-Supervised Learning☆30Oct 3, 2023Updated 2 years ago
- "Tail-Aware Sperm Analysis for Transparent Tracking of Spermatozoa" Official Implementation☆10Jan 21, 2026Updated 3 weeks ago
- KLUE Benchmark 1st place (2021.12) solutions. (RE, MRC, NLI, STS, TC)☆25Apr 11, 2022Updated 3 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆74May 15, 2024Updated last year
- Methods and evaluation for aligning language models temporally☆30Mar 2, 2024Updated last year
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆20Dec 19, 2025Updated last month
- Deno Library to upload files to GCS and obtain signed url☆11Jan 16, 2024Updated 2 years ago
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Aug 16, 2024Updated last year
- ☆39Mar 25, 2024Updated last year
- The PyTorch implementation of paper "KERMIT: Knowledge Graph Completion of Enhanced Relation Modeling with Inverse Transformation"☆15Jul 4, 2025Updated 7 months ago
- Modeling Harmonic Complexity using two models of Conditional Variational Autoencoders - MSc. Thesis☆10May 16, 2023Updated 2 years ago
- ☆10Aug 6, 2022Updated 3 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- Smart contracts for a home rental network with IoT doorlocks☆11Jun 5, 2018Updated 7 years ago
- Python tool (and library) to sign/verify files with RSA, Ed25519, or EC/secp256k1 keys☆14Apr 16, 2021Updated 4 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆11Apr 9, 2025Updated 10 months ago
- Revisiting End-to-End Speech-to-Text Translation From Scratch☆13Feb 21, 2023Updated 2 years ago
- ☆12Mar 4, 2025Updated 11 months ago
- ☆11Nov 4, 2012Updated 13 years ago
- Repository for the paper 'CausalConceptTS: Causal Attributions for Time Series Classification using High Fidelity Diffusion Models'.☆12Jul 15, 2025Updated 7 months ago
- ☆10Oct 21, 2022Updated 3 years ago
- A package for Hangul (korean alphabet)☆13Dec 19, 2022Updated 3 years ago
- ☆10Jun 5, 2025Updated 8 months ago
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 7 months ago
- Official implementation for "MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models"☆17Oct 26, 2024Updated last year
- ☆11Sep 10, 2023Updated 2 years ago
- Formalizing Multimedia Recommendation through Multimodal Deep Learning, accepted in ACM Transactions on Recommender Systems.☆19Jul 2, 2024Updated last year
- [ACM MM 2024] See or Guess: Counterfactually Regularized Image Captioning☆16Feb 17, 2025Updated last year
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated 10 months ago
- The code for "MultiWave: Multiresolution Deep Architectures through Wavelet Decomposition for Multivariate Time Series Prediction"☆14Feb 7, 2025Updated last year
- source code for EMNLP 2022 paper HEGEL: Hypergraph Transformer for Long Document Summarization☆15Oct 24, 2022Updated 3 years ago
- ☆16Jul 19, 2024Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 2 years ago