mediawiki-utilities / python-mwviewsLinks
Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.
☆66Updated 3 years ago
Alternatives and similar repositories for python-mwviews
Users that are interested in python-mwviews are comparing it to the libraries listed below
Sorting:
- Guess gender from first name in Python 2 and 3☆139Updated 8 months ago
- Wikimedia Pageview API client☆29Updated 7 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 5 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆83Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated 2 weeks ago
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 15 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A Cython implementation of the affine gap string distance☆57Updated 3 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- Library for unit extraction - fork of quantulum for python3☆145Updated last year
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated 4 months ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 10 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆183Updated 2 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago
- Wikidata client library for Python☆364Updated 3 months ago
- Tool that tries to guess a person's gender based on their name and location☆94Updated last year
- Web Service for E-Discovery Analytics☆78Updated 3 years ago
- Get list of common stop words in various languages in Python☆159Updated 3 months ago
- [development moved to termite-data-server]☆61Updated 11 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- Detect and visualize text reuse☆119Updated last year
- Another next-generation event coding platform.☆77Updated 6 years ago