ndrezn / wikipedia-histories
A Python tool to pull the complete edit history of a Wikipedia page
☆20Updated 5 months ago
Alternatives and similar repositories for wikipedia-histories:
Users that are interested in wikipedia-histories are comparing it to the libraries listed below
- This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wi…☆14Updated 4 years ago
- The GeoCorpora project aims at creating corpora of fully geo-annotated texts (in particular microblog texts) and developing tools to supp…☆18Updated 8 months ago
- Interpretable data visualizations for understanding how texts differ at the word level☆275Updated 2 months ago
- A Python wrapper around the topic modeling functions of MALLET.☆101Updated 6 months ago
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆62Updated 3 years ago
- ☆25Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Entity linking system for Wikidata updated by your edits in real time☆254Updated 5 months ago
- Blazing fast topic modelling for short texts.☆31Updated last month
- Code for the paper "Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings"☆68Updated 2 years ago
- Sentence embeddings for unsupervised event detection in the Twitter stream: study on English and French corpora☆31Updated last month
- Next generation event data ontology☆73Updated last year
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆98Updated last year
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated 2 years ago
- Collection of tools for building diachronic/historical word vectors☆431Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆161Updated 2 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Resources for the ACL 2018 publication "Which Melbourne? Augmenting Geocoding with Maps", published in July 2018.☆30Updated 6 years ago
- Code for the CUP Elements on text analysis in Python for social scientists☆137Updated 2 years ago
- Another next-generation event coding platform.☆73Updated 6 years ago
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆25Updated 6 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- ☆53Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- Source code and data for paper "Neutral Bots Probe Political Bias on Social Media" by Chen et al.☆31Updated 3 years ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆53Updated 4 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆129Updated last year
- Analysis and experiments on the UN General Debate corpus☆36Updated 6 years ago