A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.
☆143Oct 17, 2024Updated last year
Alternatives and similar repositories for simple-wikidata-db
Users that are interested in simple-wikidata-db are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python tools for interacting with Wikidata☆161Oct 28, 2023Updated 2 years ago
- Mapping Wikipedia pages to Wikidata IDs and vice versa.☆174May 11, 2023Updated 3 years ago
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models☆75May 15, 2024Updated 2 years ago
- ReFinED is an efficient and accurate entity linking (EL) system.☆242Dec 13, 2024Updated last year
- PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…☆11Jul 10, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tool for generating filtered Wikidata RDF exports☆45Apr 9, 2022Updated 4 years ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆16Nov 17, 2024Updated last year
- ☆19Feb 14, 2023Updated 3 years ago
- ☆15Jul 8, 2024Updated last year
- import a subset or a full Wikidata dump into a CouchDB database☆21Sep 10, 2024Updated last year
- Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".☆215May 3, 2024Updated 2 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Feb 18, 2022Updated 4 years ago
- ☆12Jul 17, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆33Jan 11, 2024Updated 2 years ago
- ☆34Aug 26, 2025Updated 8 months ago
- Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.☆15Dec 25, 2019Updated 6 years ago
- ☆36Feb 21, 2025Updated last year
- Pytorch implementation of EntQA paper☆65May 21, 2022Updated 4 years ago
- Implementation of the paper: "Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning…☆22Nov 2, 2021Updated 4 years ago
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- Source code for "Revisiting Unsupervised Relation Extraction" in ACL 2020☆36Jun 20, 2023Updated 2 years ago
- ☆15May 26, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a script to get a JSON file listing wikidata properties ids and their label in a given language☆34Feb 21, 2017Updated 9 years ago
- PromptKG Family: a Gallery of Prompt Learning & KG-related research works, toolkits, and paper-list.☆734Mar 22, 2024Updated 2 years ago
- PyTorch DataLoader for many VQA datasets☆15Jan 10, 2023Updated 3 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆26May 30, 2024Updated last year
- Let Models Speak Ciphers: Multiagent Debate through Embeddings☆17Feb 17, 2024Updated 2 years ago
- ☆74Oct 27, 2023Updated 2 years ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders…☆16Sep 27, 2021Updated 4 years ago
- Bi-encoder entity linking architecture☆52Sep 10, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Autoregressive Entity Retrieval☆797Jul 6, 2023Updated 2 years ago
- This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…☆17Mar 8, 2021Updated 5 years ago
- Wikidata Subsetting☆17Feb 26, 2023Updated 3 years ago
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Framework☆14May 31, 2023Updated 2 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆33Jul 25, 2024Updated last year