mailgun / forgeLinks
email dataset for email signature parsing
☆55Updated 9 years ago
Alternatives and similar repositories for forge
Users that are interested in forge are comparing it to the libraries listed below
Sorting:
- remove signature blocks from emails☆86Updated 6 years ago
- Fuzzy Categorical Distances☆14Updated 5 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Python library to infer date format from examples☆43Updated 3 years ago
- S3 Backups provides easy scripts that system administrators can use to backup data from programs likes PostgreSQL, MySQL, Redis, etc.☆67Updated 7 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Detect and classify pagination links☆15Updated 4 years ago
- Web page segmentation and noise removal☆55Updated last year
- A simple algorithm for clustering web pages, suitable for crawlers☆34Updated 8 years ago
- Script to rotate webserver log file to AWS S3☆29Updated 10 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Python search module for fast approximate string matching☆54Updated 2 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Algorithms for URL Classification☆19Updated 10 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last week
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Python client library for SeatGeek's Sixpack A/B testing framework☆40Updated 2 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆152Updated 5 months ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- ☆70Updated 2 years ago
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- A compound word splitter for Python☆48Updated 3 years ago