qipeng / wikiextractorLinks
A tool for extracting plain text from Wikipedia dumps
☆15Updated 7 years ago
Alternatives and similar repositories for wikiextractor
Users that are interested in wikiextractor are comparing it to the libraries listed below
Sorting:
- ACL'2020: Contextualized Sparse Representations for Real-Time Open-Domain Question Answering☆49Updated 5 years ago
- Code for papers "A Surprisingly Robust Trick for Winograd Schema Challenge" and "WikiCREM: A Large Unsupervised Corpus for Coreference Re…☆71Updated 3 years ago
- Code for paper "Interactive Machine Comprehension with Information Seeking Agents" -- public version☆23Updated 6 years ago
- ☆123Updated 2 years ago
- An original implementation of ACL 2019, "Compositional Questions Do Not Necessitate Multi-hop Reasoning" (Single-hop Reading Comprehensio…☆58Updated 6 years ago
- Code and data for the paper: Answer-based Adversarial Training for Generating Clarification Questions☆43Updated 4 years ago
- Dockerized code for E3: Entailment-driven Extracting and Editing for Conversational Machine Reading.☆48Updated 2 years ago
- Code to create pre-training data for a span selection pre-training task inspired by reading comprehension and an effort to avoid encoding…☆30Updated 3 years ago
- ☆59Updated 7 years ago
- Repository for NLI models (EMNLP 2018)☆61Updated 6 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Updated 2 years ago
- Datasets for the paper "Improving the Robustness of Question Answering Systems to Question Paraphrasing" (ACL 2019)☆27Updated 6 years ago
- Implementation of NeurIPS 20 paper: Latent Template Induction with Gumbel-CRFs☆57Updated 4 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated last year
- Evidence-based QA system for community question answering.☆109Updated 4 years ago
- NABERT model for solving the DROP dataset☆26Updated 6 years ago
- Code and Data for our EMNLP 2020 paper titled 'Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multiho…☆28Updated 3 years ago
- Tools and datasets for Aristo Leaderboards☆42Updated 4 years ago
- Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19☆47Updated 6 years ago
- A novel method of constrained decoding for neural NLG (NNLG) models☆84Updated 5 years ago
- A repository for converting between CoQA, SQuAD2, and QuAC and visualizing the data.☆25Updated 6 years ago
- This is the repo for the paper "Revealing the Importance of Semantic Retrieval for Machine Reading at Scale".☆60Updated 5 years ago
- Phrase-Indexed Question Answering (PIQA)☆94Updated 6 years ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Updated 5 years ago
- ☆12Updated 5 years ago
- ☆86Updated 5 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆57Updated 3 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated last month
- ☆63Updated 3 years ago
- WebConf 2020 paper Leading Conversational Search by Suggesting Useful Questions☆32Updated 5 years ago