Trindaz / EFZP
Parse an email to get properties like salutation, body, signature, reply.
☆44Updated last year
Alternatives and similar repositories for EFZP:
Users that are interested in EFZP are comparing it to the libraries listed below
- A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so on…☆11Updated 2 months ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Fast, lightweight Python database toolkit for SQLite, built with Cython.☆42Updated last year
- remove signature blocks from emails☆86Updated 5 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- A Python module for extracting relevant tags from text documents.☆15Updated 13 years ago
- JupyterLite as a Datasette plugin☆11Updated 3 years ago
- Scrapy with Headless Selenium, for scraping interactive web pages☆10Updated 2 years ago
- 💬NLP - Library for splitting email content into a human-written body and an automatically appended signature.☆25Updated 6 years ago
- KnowledgeRepo + JupyterLab☆48Updated 3 months ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆18Updated 10 years ago
- A simple algorithm for clustering web pages, suitable for crawlers☆34Updated 7 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Python interface to IMDb plain-text data files☆41Updated 7 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- 🤖 Slack slash command that turns a link to an article into a readable post within a channel☆10Updated 2 years ago
- Parsing resumes in a PDF format from linkedIn☆68Updated 8 years ago
- Street address parser and formatter☆91Updated 5 years ago
- Python library to infer date format from examples☆42Updated 3 years ago
- ☆12Updated 8 years ago
- Utilities for dealing with URIs, invented and maintained by Yelp.☆14Updated last year
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆20Updated last year
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- Path utilities for Python☆48Updated last year
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- email dataset for email signature parsing☆55Updated 8 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated last week