Code for the paper "Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documentss"
☆15Oct 8, 2024Updated last year
Alternatives and similar repositories for realistic_knowledge_conflicts
Users that are interested in realistic_knowledge_conflicts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jan 16, 2025Updated last year
- Entity-Based Knowledge Conflicts in Question Answering. Code repo for EMNLP2021 paper: https://aclanthology.org/2021.emnlp-main.565/☆77Aug 29, 2022Updated 3 years ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆16Mar 20, 2023Updated 3 years ago
- Hierarchical Universal Modular ANotator☆12Mar 6, 2026Updated 2 weeks ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Enhanced Unsupervised Object Discoveries through Exhaustive Self-Supervised Transformers☆15Jun 25, 2024Updated last year
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated 3 weeks ago
- ☆12Oct 4, 2021Updated 4 years ago
- Elegant and fast Material Design template for academics. Perfect 100/100 performance score.☆12Mar 21, 2025Updated last year
- ☆17Apr 11, 2025Updated 11 months ago
- explainable-machine-translation-metrics☆12Jul 15, 2022Updated 3 years ago
- PyTorch utilities for ML, specifically speech☆13Jan 30, 2024Updated 2 years ago
- Android releases of Clubhouse App☆14Apr 9, 2021Updated 4 years ago
- [TPAMI 2025] Revisiting Essential and Non-Essential Settings of Evidential Deep Learning☆25Jun 24, 2025Updated 9 months ago
- Code for COLING 2020 paper "Improving Document-level Sentiment Analysis with User and Product Context"☆11Apr 13, 2022Updated 3 years ago
- Code for "Training, Architecture, and Prior for Deterministic Uncertainty Methods" ICLR 2023 Workshop on Trustworthy ML☆12Jun 15, 2023Updated 2 years ago
- Deep Learning, University of Twente☆10Dec 16, 2020Updated 5 years ago
- SemEval2026 Task 3 DimABSA☆29Mar 13, 2026Updated last week
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆15Jul 22, 2025Updated 8 months ago
- [ACL 2025] Adaptive Retrieval without Self-Knowledge? Bringing Uncertainty Back Home☆17May 17, 2025Updated 10 months ago
- ☆14Dec 9, 2021Updated 4 years ago
- the open-source code of QAgent☆56Oct 14, 2025Updated 5 months ago
- ☆17May 19, 2023Updated 2 years ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- Inference code in Pytorch for GPT-like models, such as PAGnol, a family of models with up to 1.5B parameters, trained on datasets in Fren…☆20Oct 18, 2022Updated 3 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated last month
- ☆28May 27, 2024Updated last year
- ☆23Sep 21, 2020Updated 5 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆42Mar 7, 2026Updated 2 weeks ago
- One implementation of the paper "Coreference-Aware Dialogue Summarization".☆19Nov 9, 2023Updated 2 years ago
- Build ML pipelines with smart caching and remote execution. Develop locally, deploy to HPC clusters instantly. Track with Aim. 🎯☆13Feb 10, 2026Updated last month
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- ☆14Jul 18, 2022Updated 3 years ago
- Fine-tuning BART on COVID Dialogue Dataset☆17Apr 8, 2020Updated 5 years ago
- Constructing community of LLM-based Agent in the minecraft☆17Nov 27, 2025Updated 3 months ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆13Apr 1, 2024Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆27May 22, 2025Updated 10 months ago