andrewtavis / wikirepo
Python based Wikidata framework for easy dataframe extraction
β43Updated last year
Alternatives and similar repositories for wikirepo:
Users that are interested in wikirepo are comparing it to the libraries listed below
- πΈ Train floret vectorsβ18Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.β62Updated last week
- A maximum-strength name parser for record linkage.β36Updated last month
- β30Updated 2 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- Extract networks of entities from journalistic reportingβ48Updated last year
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookupβ69Updated 3 years ago
- Metadata Extractor & Loader (MEL) β The NLP-NER Toolkit (TNNT)β22Updated 2 years ago
- Trying to generate name synonyms from wikidataβ32Updated 4 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchβ70Updated 3 years ago
- π Dehyphenation of broken text (mainly German), i.e., extracted from a PDFβ38Updated 3 years ago
- β54Updated last year
- Language detection using Spacy and Fasttextβ55Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- Generate reports for spaCy models.β29Updated 2 years ago
- Finds linguistic patterns effortlesslyβ35Updated last year
- Named entity recognition for the legal domainβ42Updated 3 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β15Updated 7 months ago
- Citation Classification using hybrid neural network model for Wikipedia Referencesβ28Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ157Updated 2 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is doβ¦β48Updated 11 months ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissionsβ19Updated last year
- A Python library for topic modeling and visualizationβ65Updated 4 years ago
- Wikidata authority file mapping toolβ11Updated 6 years ago
- A collection of notebooks for Natural Language Processingβ25Updated 2 months ago
- Scalable String Similarity Joins in Pythonβ38Updated 8 months ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β56Updated 3 months ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classificationβ¦β19Updated 5 years ago