☆57May 19, 2025Updated 9 months ago
Alternatives and similar repositories for counterfactual-evaluation
Users that are interested in counterfactual-evaluation are comparing it to the libraries listed below
Sorting:
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Crawled Wikipedia Tables with Passages☆13Aug 19, 2021Updated 4 years ago
- ☆13May 12, 2025Updated 9 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Code and data for the paper "In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation"☆26Aug 22, 2023Updated 2 years ago
- The SVO-Probes Dataset for Verb Understanding☆30Jan 28, 2022Updated 4 years ago
- ☆12Jun 30, 2024Updated last year
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Data and code for EMNLP 2023 industry-track paper "Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-W…☆30Jan 5, 2024Updated 2 years ago
- ☆37Jul 16, 2023Updated 2 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆16Oct 18, 2023Updated 2 years ago
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆41Mar 1, 2023Updated 3 years ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆44Nov 17, 2025Updated 3 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Jul 27, 2023Updated 2 years ago
- ☆22Mar 28, 2024Updated last year
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Dec 27, 2022Updated 3 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆21Jun 14, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- MERA tensor network for tiny object image classification☆16Mar 31, 2022Updated 3 years ago
- ☆26Nov 8, 2022Updated 3 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- Data and Code Release for "On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries"☆55Nov 9, 2020Updated 5 years ago
- ☆33Jul 8, 2024Updated last year
- Augmenting Statistical Models with Natural Language Parameters☆29Sep 17, 2024Updated last year
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆51Oct 11, 2025Updated 4 months ago
- ☆31Jun 12, 2024Updated last year
- ☆102Dec 7, 2023Updated 2 years ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆68Sep 29, 2024Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆259Feb 21, 2023Updated 3 years ago
- ☆28Feb 17, 2024Updated 2 years ago
- [Spotlight ICLR 2023 paper] Continual evaluation for lifelong learning with neural networks, identifying the stability gap.☆35Apr 2, 2023Updated 2 years ago