sroy9 / mawps
Code for MAWPS: A Math Word Problem Repository
☆40Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mawps
- ☆80Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆39Updated 10 months ago
- A unified benchmark for math reasoning☆87Updated last year
- An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.☆31Updated 2 years ago
- ☆42Updated 9 months ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆34Updated last year
- ☆43Updated last year
- The official code of TACL 2021, "Did Aristotle Use a Laptop? A Question Answering Benchmark with Implicit Reasoning Strategies".☆64Updated 2 years ago
- ☆48Updated last year
- Detect hallucinated tokens for conditional sequence generation.☆63Updated 2 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆49Updated 3 years ago
- EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443☆81Updated last month
- ☆35Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- Code for ACL 21: Generating Query Focused Summaries from Query-Free Resources☆33Updated 2 years ago
- ☆42Updated 3 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆58Updated 7 months ago
- ☆97Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20Updated 2 years ago
- ☆57Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated last year
- Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases☆29Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆60Updated 3 months ago
- Automatic metrics for GEM tasks☆61Updated 2 years ago
- ☆41Updated last year
- code associated with ACL 2021 DExperts paper☆113Updated last year
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- [EMNLP 2020] Collective HumAn OpinionS on Natural Language Inference Data☆33Updated 2 years ago
- FRANK: Factuality Evaluation Benchmark☆52Updated last year