glimmerphoenix / WikiDATLinks
Wikipedia Data Analysis Toolkit
☆26Updated 8 years ago
Alternatives and similar repositories for WikiDAT
Users that are interested in WikiDAT are comparing it to the libraries listed below
Sorting:
- Data Server for Topic Models☆121Updated 2 years ago
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 7 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆50Updated 2 months ago
- Generating Wikipedia article embeddings using Word2vec and reading sessions☆18Updated 8 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Quantitative Text Analysis for the digitale Geisteswissenschaften☆47Updated 10 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated 2 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆109Updated 10 years ago
- 150,000 tweets from 2016's second presdential debate between Hillary Clinton and Donald Trump☆10Updated 8 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Literature and Data - Spring 2016 Data Science Connector Course☆24Updated 11 months ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆37Updated last year
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- PLOS Subject Area Thesaurus☆40Updated 11 months ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- ☆39Updated 7 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- Bill cosponsorship networks in European parliaments.☆17Updated 8 years ago
- ☆32Updated 10 years ago
- The RICardo dataset compiles trade statistics sources of international trade bilateral flows of the 19th century.☆19Updated 2 months ago
- CSV inspection☆10Updated 2 years ago
- Adding links to full text in Wikipedia references☆37Updated 3 months ago
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 8 years ago
- A generic, machine learning-based revision scoring system for MediaWiki☆91Updated last year
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated 2 years ago