mediawiki-utilities / python-mwviewsLinks
Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.
☆65Updated 3 years ago
Alternatives and similar repositories for python-mwviews
Users that are interested in python-mwviews are comparing it to the libraries listed below
Sorting:
- Guess gender from first name in Python 2 and 3☆137Updated 3 months ago
- Wikimedia Pageview API client☆28Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 2 years ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- Data Server for Topic Models☆121Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated last year
- ☆32Updated 10 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- Import tables from any Wikipedia article as a dataset in Python☆292Updated 3 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆141Updated last year
- Library for unit extraction - fork of quantulum for python3☆142Updated last year
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- Web Service for E-Discovery Analytics☆75Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated last month
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- Force-Atlas 2 graph layout in networkx☆22Updated 10 years ago
- Get list of common stop words in various languages in Python☆156Updated last year
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆179Updated 6 years ago
- A simple fuzzy matching set for python strings☆229Updated last year
- Detect and visualize text reuse☆118Updated 11 months ago
- Wikipedia Data Analysis Toolkit☆26Updated 8 years ago
- Python package for stylometry☆63Updated 4 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆109Updated 10 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆247Updated 3 weeks ago
- Extract countries, regions and cities from a URL or text☆217Updated 4 years ago
- Python module for bibliographic network analysis.☆87Updated 4 years ago