jlettvin / SimilarLinks
A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.
☆20Updated 9 years ago
Alternatives and similar repositories for Similar
Users that are interested in Similar are comparing it to the libraries listed below
Sorting:
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆153Updated 2 years ago
- 🍻Uses Google, Yelp, and Foursquare APIs to retrieve and rank bars☆87Updated 8 years ago
- E-commerce scraping and analytics platform.☆52Updated 9 years ago
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- Language Lego☆142Updated 6 years ago
- Automatic Web Article Summarizer☆415Updated 4 years ago
- Auto-transcribe your meetings to Slack in real time☆156Updated 6 years ago
- Extract postal addresses from the DOM☆66Updated 13 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆311Updated 9 years ago
- Python module to watch Twitter user pages or search-results.☆64Updated 11 years ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆203Updated this week
- remove signature blocks from emails☆87Updated 6 years ago
- It finds best synonyms from Google Books when you press a hotkey☆30Updated 10 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- A repo for a blog post looking at the Edinburgh Fringe Festival jokes☆17Updated 4 years ago
- A set of Python Pillow Lambda functions to cover day-to-day use cases☆13Updated 9 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 2 months ago
- A fast python scikit-learn text sentiment API server.☆89Updated 10 years ago
- Mechanical Turk on your own machine.☆208Updated last year
- online natural language processing with word vectors☆310Updated last year
- A python script for summarizing articles using nltk☆546Updated 9 years ago
- Automatic Item List Extraction☆87Updated 9 years ago
- A library for extracting tables from PDF files☆92Updated 5 years ago
- Twitter bot generating invented words and definitions using RNN + genetic algorithm☆131Updated 9 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- A microservice for archiving the news.☆165Updated 9 years ago
- Open Source implementation of Summly☆47Updated 8 years ago
- Aviation grade news article metadata extraction☆36Updated 2 years ago
- The smart and simple way to automate document assembly☆408Updated 7 years ago