jlettvin / SimilarLinks
A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.
☆20Updated 9 years ago
Alternatives and similar repositories for Similar
Users that are interested in Similar are comparing it to the libraries listed below
Sorting:
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆151Updated 2 years ago
- Auto-transcribe your meetings to Slack in real time☆155Updated 5 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆312Updated 9 years ago
- Extract postal addresses from the DOM☆66Updated 13 years ago
- Searching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files☆102Updated 4 years ago
- Language Lego☆141Updated 5 years ago
- Automatic Web Article Summarizer☆417Updated 3 years ago
- Wake up to a chorus of birds in this alarm clock developed by Carnegie Museums of Pittsburgh.☆67Updated 7 years ago
- conceptnet 4 bridge☆71Updated 10 years ago
- E-commerce scraping and analytics platform.☆53Updated 9 years ago
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41Updated 8 years ago
- A repo for a blog post looking at the Edinburgh Fringe Festival jokes☆17Updated 4 years ago
- Speed up your Localization / Internationalization efforts by automating translation with a single script☆27Updated 8 years ago
- 🍻Uses Google, Yelp, and Foursquare APIs to retrieve and rank bars☆87Updated 8 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- Automatic text summarization☆243Updated 6 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A python script for summarizing articles using nltk☆545Updated 9 years ago
- This is a simple application for scraping and parsing food recipe data found on the web in hRecipe format, producing results in json☆111Updated 5 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆173Updated 2 years ago
- A trend viewer written in Python/JavaScript☆21Updated 9 months ago
- It finds best synonyms from Google Books when you press a hotkey☆30Updated 10 years ago
- remove signature blocks from emails☆86Updated 6 years ago
- A very naive classifier to figure out if a sentence contains dirty words☆33Updated 10 years ago
- Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)☆204Updated last year
- Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.☆84Updated 9 years ago
- Nodejs text sumarization☆54Updated 11 years ago
- A fast python scikit-learn text sentiment API server.☆89Updated 9 years ago
- Compares two XML documents by diffing their text.☆42Updated last year
- Repository for PyCon 2016 workshop Natural Language Processing in 10 Lines of Code☆240Updated 8 years ago