ndrezn / wikipedia-historiesLinks
A Python tool to pull the complete edit history of a Wikipedia page
☆20Updated 7 months ago
Alternatives and similar repositories for wikipedia-histories
Users that are interested in wikipedia-histories are comparing it to the libraries listed below
Sorting:
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- Resources for the ACL 2018 publication "Which Melbourne? Augmenting Geocoding with Maps", published in July 2018.☆30Updated 6 years ago
- A machine learning tool for fishing entities☆263Updated last month
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆63Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆278Updated 5 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Entity linking system for Wikidata updated by your edits in real time☆256Updated 7 months ago
- Collection of tools for building diachronic/historical word vectors☆437Updated last year
- PYthon Automated Term Extraction☆315Updated 2 years ago
- Python based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.☆226Updated last year
- This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wi…☆14Updated 5 years ago
- Wikidata client library for Python☆355Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆130Updated last year
- A Python wrapper around the topic modeling functions of MALLET.☆103Updated 8 months ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆65Updated 3 years ago
- Another next-generation event coding platform.☆76Updated 6 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆163Updated 2 years ago
- Create a Geonames gazetteer index in Elasticsearch☆77Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆140Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Palmetto is a quality measuring tool for topics☆216Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆107Updated last month
- Fuzzy matching and more functionality for spaCy.☆256Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆260Updated 10 months ago
- Implementation of the ClausIE information extraction system for python+spacy☆224Updated 2 years ago
- Full text geoparsing as a Python library☆751Updated 3 years ago
- Next generation event data ontology☆73Updated last year