NickMonzillo / EmailParser
Stores email header and body information in JSON format
☆13Updated 8 years ago
Related projects: ⓘ
- extract difference between two html pages☆32Updated 6 years ago
- Extract Social Profiles using Email Addresses (Python)☆14Updated 6 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated 8 months ago
- Extract social media links and account names from websites.☆36Updated 4 years ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 10 years ago
- Collect email addresses by crawling search engine results.☆29Updated last year
- Streaming web crawler with WebSocket API☆44Updated last year
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆17Updated 3 years ago
- A generic crawler☆78Updated 6 years ago
- Paginating the web☆37Updated 10 years ago
- Verify emails with python!☆36Updated 12 years ago
- A python tool to extract data types such as email, URL, domains and phone numbers.☆36Updated 11 years ago
- Simple Python 3 web crawler☆13Updated 4 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆21Updated 3 years ago
- Scrapy middleware which allows to crawl only new content☆79Updated last year
- API client for Aleph, supports bulk entity and document upload.☆27Updated last month
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 4 months ago
- Pluggable DSL that uses pipes to perform a series of linear transformations to extract data☆15Updated 2 months ago
- List of Sanctions and Most wanted☆26Updated 7 years ago
- Tools to easy generate RSS feed that contains each scraped item using Scrapy framework.☆30Updated 4 months ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆54Updated last month
- PST Parser using pypff - Export all email headers and body to csv or json☆9Updated 4 years ago
- Broad crawler for domain discovery☆19Updated 6 years ago
- Scrape various open data directories to create an index of what's available out there☆29Updated this week
- Simple Web UI for Scrapy spider management via Scrapyd☆49Updated 6 years ago
- Python library for modern thread / multiprocessing pooling and task processing via asyncio☆15Updated 3 years ago
- ☆29Updated 3 years ago
- A simple DuckDuckGo URL scraper.☆23Updated 7 months ago
- Python, Tor, Stem, Privoxy: with this tools, allow requests new connections via Tor for obtain new IP addresses.☆24Updated 5 years ago