rliebz / whoswho
Name comparison in python
☆51Updated 5 years ago
Related projects: ⓘ
- Change detection with a simple Python script to email you whenever a website changes.☆57Updated last year
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆142Updated 7 months ago
- Pixel Safe Encryption - Now Cryptographically Secure☆56Updated 8 months ago
- A helper library full of URL-related heuristics.☆56Updated 2 weeks ago
- Tag news stories based on models trained on the NYT corpus.☆39Updated last year
- ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of diff…☆87Updated 2 years ago
- Predict age and gender from a first name☆60Updated 5 years ago
- Find strings/words in text; convenience and C speed☆125Updated 2 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆158Updated last week
- A Python-based sleep tracker and acoustic brain stimulator built on a Raspberry Pi 💤🔊🧠〰️☆30Updated 3 years ago
- Count the number of matches for a regex string in a subreddit☆11Updated 4 years ago
- Tool for real-time scraping of news articles.☆39Updated 4 years ago
- Lightning Fast Language Prediction 🚀☆163Updated 5 years ago
- Emoji data from Emojipedia☆47Updated 4 years ago
- Predicts likes, comment or total interactions of a facebook page post using machine learning☆10Updated 6 years ago
- Create an animated heatmap from a Google location data Takeout export☆27Updated 2 years ago
- Abydos NLP/IR library for Python☆180Updated last year
- Extract networks of entities from journalistic reporting☆46Updated last year
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆49Updated 7 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- The tastiest machine learning project. Can we predict who is speaking for how long during an episode of the syntax.fm podcast?☆36Updated 5 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last year
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆97Updated 3 years ago
- Scraping Assisted by Learning☆35Updated last week
- Alternative robots parser module for Python☆16Updated this week
- A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.☆47Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 8 months ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated 11 months ago