andrewtavis / wikirepoLinks
Python based Wikidata framework for easy dataframe extraction
β45Updated 2 years ago
Alternatives and similar repositories for wikirepo
Users that are interested in wikirepo are comparing it to the libraries listed below
Sorting:
- πΈ Train floret vectorsβ18Updated 2 years ago
- A maximum-strength name parser for record linkage.β39Updated 3 months ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookupβ70Updated 6 months ago
- Language detection using Spacy and Fasttextβ57Updated 2 years ago
- Wikidata authority file mapping toolβ11Updated 7 years ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graphβ40Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Named entity recognition for the legal domainβ42Updated 4 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ95Updated 2 years ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ26Updated last month
- β55Updated last year
- β70Updated 3 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ39Updated 3 years ago
- β30Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ169Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β21Updated last year
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 4 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.β62Updated last week
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchβ71Updated 3 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorizationβ13Updated 8 years ago
- Trying to generate name synonyms from wikidataβ34Updated 5 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporatedβ¦β26Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ87Updated 3 years ago
- Python tools for interacting with Wikidataβ159Updated 2 years ago
- Generate reports for spaCy models.β29Updated 3 years ago
- Extract networks of entities from journalistic reportingβ49Updated 2 years ago
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endpβ¦β86Updated this week
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated 2 years ago
- πGUI for training spaCy modelsβ55Updated 4 years ago
- Create a Geonames gazetteer index in Elasticsearchβ78Updated 2 years ago