jlettvin / SimilarLinks
A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.
☆20Updated 9 years ago
Alternatives and similar repositories for Similar
Users that are interested in Similar are comparing it to the libraries listed below
Sorting:
- Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSO…☆153Updated 2 years ago
- Automatic Web Article Summarizer☆416Updated 4 years ago
- Extract postal addresses from the DOM☆66Updated 13 years ago
- Auto-transcribe your meetings to Slack in real time☆156Updated 6 years ago
- Aviation grade news article metadata extraction☆36Updated 2 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆141Updated last year
- Searching for the occurrence seconds of words/phrases or arbitrary regex patterns within audio files☆102Updated 5 years ago
- A proof of concept using IBM's Speech-to-Text API to do quick-and-dirty transcriptions☆311Updated 9 years ago
- 🍻Uses Google, Yelp, and Foursquare APIs to retrieve and rank bars☆87Updated 8 years ago
- A microservice for archiving the news.☆165Updated 9 years ago
- Automatic text summarization☆243Updated 7 years ago
- A scraping command line tool for the modern web☆259Updated 9 years ago
- It finds best synonyms from Google Books when you press a hotkey☆30Updated 11 years ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 9 years ago
- Get semantic HTML from PDFs, recover lost text, tables, data... in bulk.☆35Updated last year
- Notetaking Electron app that can answer your questions and makes summaries for you☆90Updated 3 years ago
- Language Lego☆143Updated 6 years ago
- Rewriting web proxy and archival tool. At this point, it just tries to download all the things.☆203Updated last week
- A program to convert USDA nutrient database into various formats☆93Updated 9 years ago
- A company/project name generator for Python. Uses NLTK and diverse techniques derived from existing corporate etymologies and naming agen…☆50Updated 8 years ago
- The smart and simple way to automate document assembly☆408Updated 7 years ago
- A framework for creating semi-automatic web content extractors☆502Updated this week
- Data Pipes for CSV☆115Updated 2 years ago
- Wake up to a chorus of birds in this alarm clock developed by Carnegie Museums of Pittsburgh.☆66Updated 7 years ago
- Python binding to libpoppler with focus on text extraction☆97Updated 4 years ago
- Creates github index for similar repositories discovery☆192Updated 9 years ago
- Train your own Natural Language Processor from a browser 🤖 (Prototype)☆174Updated 2 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- next generation web crawling using machine intelligence☆332Updated 2 years ago