ndrezn / wikipedia-histories
A Python tool to pull the complete edit history of a Wikipedia page
☆20Updated 2 months ago
Alternatives and similar repositories for wikipedia-histories:
Users that are interested in wikipedia-histories are comparing it to the libraries listed below
- The GeoCorpora project aims at creating corpora of fully geo-annotated texts (in particular microblog texts) and developing tools to supp…☆18Updated 6 months ago
- Resources for the ACL 2018 publication "Which Melbourne? Augmenting Geocoding with Maps", published in July 2018.☆30Updated 6 years ago
- This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wi…☆14Updated 4 years ago
- ☆25Updated 5 years ago
- ☆31Updated 9 years ago
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆62Updated 3 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Collection of tools for building diachronic/historical word vectors☆423Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Full text geoparsing/toponym resolution with event geolocation☆72Updated last week
- Next generation event data ontology☆72Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- Dynamic Word Embeddings for Evolving Semantic Discovery code.☆73Updated 2 years ago
- Blazing fast topic modelling for short texts.☆31Updated last month
- Entity linking system for Wikidata updated by your edits in real time☆250Updated 2 months ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆70Updated 3 years ago
- Neural network based lemmatizer for Finnish language☆11Updated 4 years ago
- Another next-generation event coding platform.☆73Updated 5 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆12Updated 5 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 3 months ago
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆72Updated 2 months ago
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆101Updated 3 weeks ago
- Project on the history of genre.☆22Updated 5 years ago
- Tools to train and explore diachronic word embeddings from Big Historical Data☆21Updated 3 weeks ago
- linguistics backend☆41Updated last year
- Extract networks of entities from journalistic reporting☆48Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago