lschmelzeisen / nastyLinks
NASTY Advanced Search Tweet Yielder
☆50Updated 5 years ago
Alternatives and similar repositories for nasty
Users that are interested in nasty are comparing it to the libraries listed below
Sorting:
- Cleans Reddit Text Data☆84Updated 5 years ago
- Experiments to help discussion on Wikipedia talk pages☆68Updated last week
- A Stylometry Library for Python☆147Updated 2 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆196Updated last week
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆37Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Python tools for interacting with Wikidata☆160Updated 2 years ago
- track changes to the news, where news is anything with an RSS feed☆182Updated 5 years ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 7 years ago
- Analyze and extract Wikipedia article text and attributes and store them into an ElasticSearch index or to json files (multilingual suppo…☆47Updated 2 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆50Updated 6 months ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago
- A set of utilities for processing MediaWiki XML dump data.☆61Updated 11 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 10 months ago
- 📝🔍 A browser extension that displays the GPT-2 Log Probability of selected text☆112Updated 2 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆286Updated 11 months ago
- Some tools to help analyze the twitter archive☆64Updated 7 months ago
- ☆35Updated 2 years ago
- Python client for thegaurdian api☆73Updated last year
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆63Updated last year
- Twitter conversation collection script, which collects all replies to a given tweet☆68Updated 10 years ago
- Hate Speech Detection Library for Python.☆194Updated 2 months ago
- ☆32Updated 10 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- ☆79Updated 7 years ago
- Estimating the age of web resources☆97Updated 7 months ago
- Social Media Mining Toolkit (SMMT) main repository☆136Updated 3 years ago
- Backend component for Hoaxy, a tool to visualize the spread of claims and fact checking☆140Updated 3 years ago