mailgun / forgeLinks
email dataset for email signature parsing
β55Updated 9 years ago
Alternatives and similar repositories for forge
Users that are interested in forge are comparing it to the libraries listed below
Sorting:
- Lightning Fast Language Prediction πβ167Updated 6 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fiβ¦β48Updated 3 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- remove signature blocks from emailsβ86Updated 6 years ago
- Language detection extension for spaCy 2.0+β113Updated 6 years ago
- Web page segmentation and noise removalβ55Updated last year
- Python search module for fast approximate string matchingβ54Updated 2 years ago
- A python library detect and extract listing data from HTML page.β108Updated 8 years ago
- Email reply parser library for Pythonβ509Updated last year
- Textpipe: clean and extract metadata from textβ302Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.β24Updated last week
- Server/Client around Spacy to load spacy only onceβ46Updated 7 years ago
- Natural language generation languageβ56Updated 6 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.β153Updated 2 weeks ago
- Find which links on a web page are pagination linksβ29Updated 8 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.β33Updated 3 months ago
- A Python implementation of the Metaphone and Double Metaphone algorithmsβ81Updated last year
- Modularly extensible semantic metadata validatorβ84Updated 9 years ago
- Traptor -- A distributed Twitter feedβ26Updated 2 years ago
- Spam filtering made easy for youβ142Updated 5 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.β98Updated 4 years ago
- π« REST microservices for various spaCy-related tasksβ240Updated 3 years ago
- WordNet Domains, WordNet Affect and SentiWordsβ48Updated 9 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.β151Updated 5 years ago
- Skinfer is a tool for inferring and merging JSON schemasβ139Updated last year
- spaCy pipeline component for adding text readability meta data to Doc objects.β56Updated 6 years ago
- Relatively simple text classification powered by spaCyβ41Updated 9 years ago
- NER toolkit for HTML dataβ259Updated last year
- An automated ingestion service for blogs to construct a corpus for NLP research.β86Updated 7 years ago
- With alexafsm, developers can model dialog agents with first-class concepts such as states, attributes, transition, and actions. alexafsmβ¦β111Updated 2 years ago