ZhaofengWu / counterfactual-evaluationView external linksLinks
☆57May 19, 2025Updated 8 months ago
Alternatives and similar repositories for counterfactual-evaluation
Users that are interested in counterfactual-evaluation are comparing it to the libraries listed below
Sorting:
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- Implementation of our paper "Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation". Accepted in EACL …☆11May 22, 2023Updated 2 years ago
- Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"☆12Oct 28, 2022Updated 3 years ago
- Crawled Wikipedia Tables with Passages☆13Aug 19, 2021Updated 4 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- ☆13May 12, 2025Updated 9 months ago
- Code and data for the paper "In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation"☆26Aug 22, 2023Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Sep 24, 2023Updated 2 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- ☆11Jul 17, 2021Updated 4 years ago
- Data and code for EMNLP 2023 industry-track paper "Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-W…☆30Jan 5, 2024Updated 2 years ago
- ☆37Jul 16, 2023Updated 2 years ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- Cooperative Training of Descriptor and Generator Networks☆16Oct 19, 2018Updated 7 years ago
- This is the official code repository for the paper 'Cross-modality Data Augmentation for End-to-End Sign Language Translation'. Accepted…☆16Oct 18, 2023Updated 2 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- This is the code for the ICLR 2023 paper "Leveraging Large Language Models for Multiple Choice Question Answering."☆41Mar 1, 2023Updated 2 years ago
- ☆25Oct 17, 2023Updated 2 years ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆44Nov 17, 2025Updated 2 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆38Jul 27, 2023Updated 2 years ago
- ☆22Mar 28, 2024Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- This repository contains source code for the PASTA model, a pre-trained language model for table-based fact verification.☆18Dec 27, 2022Updated 3 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆21Jun 14, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- ☆27Oct 30, 2023Updated 2 years ago
- Summary of recent news recommendation papers.☆25Feb 2, 2022Updated 4 years ago
- ☆26Nov 8, 2022Updated 3 years ago
- The OccludedPASCAL3D+ is a dataset generated via superimposing occluder to PASCAL3D+ dataset for multiple computer vision tasks.☆27Jun 5, 2022Updated 3 years ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Jan 21, 2024Updated 2 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- Code for Findings of ACL 2021 paper: Logic-Consistency Text Generation from Semantic Parses☆26Aug 3, 2021Updated 4 years ago
- ☆31Jun 12, 2024Updated last year
- ☆33Jul 8, 2024Updated last year
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆48Oct 11, 2025Updated 4 months ago