Evaluate QA models for consistency
☆20Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for qa_consistency
Users that are interested in qa_consistency are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets for the paper "Improving the Robustness of Question Answering Systems to Question Paraphrasing" (ACL 2019)☆27Aug 7, 2019Updated 6 years ago
- ☆23Aug 1, 2024Updated last year
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆11Jul 10, 2022Updated 3 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- implement of paper 'Probabilistic End-to-end Noise Correction for Learning with Noisy Labels'☆16Jul 18, 2019Updated 6 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- ☆10Jul 24, 2023Updated 2 years ago
- The source code for Adaptive Kernel Graph Neural Network at AAAI2022☆14Feb 23, 2022Updated 4 years ago
- Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …☆11Nov 25, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024☆16May 11, 2024Updated last year
- InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆21Apr 7, 2026Updated 3 weeks ago
- ☆11Mar 26, 2020Updated 6 years ago
- ☆15Mar 12, 2024Updated 2 years ago
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)☆20Apr 16, 2025Updated last year
- (NeurIPS 2025) SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions☆34Nov 16, 2025Updated 5 months ago
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- Tool to convert JSON formatted discussion posts on Canvas LMS into HTML files - similar to saving student text-entry assignments☆13May 20, 2022Updated 3 years ago
- A PyTorch implementation of visual interaction networks☆12Jul 1, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for the paper "multi-hop paragraph retrieval for open-domain question answering"☆36Jun 21, 2022Updated 3 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆16Jan 8, 2022Updated 4 years ago
- N/A☆18Aug 15, 2022Updated 3 years ago
- Hypergraph convolution and attention networks research☆15Jul 31, 2024Updated last year
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆44Oct 12, 2022Updated 3 years ago
- Gmail Mail Merge that uses a separate sheet for metadata to control the merge process. Useful for connecting with Google Form output, wh…☆14Apr 17, 2021Updated 5 years ago
- The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.☆16Mar 21, 2021Updated 5 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- Code for AAAI 2020 paper "Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents"☆41Jan 23, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Oct 14, 2023Updated 2 years ago
- Official code repository for the EMNLP 2021 paper☆26Jan 30, 2022Updated 4 years ago
- Tensor product decomposition network☆20Mar 5, 2021Updated 5 years ago
- Evaluation Pipeline for medical tasks.☆12Apr 8, 2026Updated 3 weeks ago
- 2019语言与智能技术竞赛第5名方案☆14Dec 2, 2019Updated 6 years ago
- Neural Relation Extraction in Pytorch☆19Mar 13, 2019Updated 7 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13May 29, 2020Updated 5 years ago