neelguha / simple-wikidata-db
A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
☆102Updated last month
Related projects ⓘ
Alternatives and complementary repositories for simple-wikidata-db
- Mapping Wikipedia pages to Wikidata IDs and vice versa.☆153Updated last year
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆58Updated last year
- Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction"☆81Updated last year
- You can find the most recent KGQA benchmark numbers from publications here.☆112Updated 4 months ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆100Updated 9 months ago
- Structured Prediction for Entity Linking☆27Updated 3 months ago
- Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"☆107Updated 6 months ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆148Updated last year
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆99Updated last year
- ReFinED is an efficient and accurate entity linking (EL) system.☆191Updated 10 months ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- Code for our paper "Graph Pre-training for AMR Parsing and Generation" in ACL2022☆95Updated 6 months ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆63Updated 2 years ago
- Contrastive Fact Verification☆70Updated 2 years ago
- OpenIE6 system☆121Updated 2 years ago
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆44Updated last year
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆97Updated 2 months ago
- Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".☆196Updated 6 months ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆104Updated 2 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆57Updated 2 years ago
- CoCo-Ex extracts meaningful concepts from natural language texts and maps them to conjunct concept nodes in ConceptNet, utilizing the max…☆58Updated last year
- [EMNLP 2022] Generative Knowledge Graph Construction: A Review☆105Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆96Updated last year
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆64Updated last year
- ACL 2023 (Findings) - BertNet: Harvesting Knowledge Graphs from Pretrained Language Models☆101Updated 4 months ago
- Pytorch implementation of EntQA paper☆62Updated 2 years ago
- Code for the paper "Open Domain Question Answering with A Unified Knowledge Interface" (ACL 2022)☆57Updated last year
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 2 years ago
- MTab: Entity Search and Table Annotation with Wikidata, Wikipedia, and DBpedia☆30Updated 2 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Updated 2 years ago