mailgun / forgeLinks
email dataset for email signature parsing
β55Updated 9 years ago
Alternatives and similar repositories for forge
Users that are interested in forge are comparing it to the libraries listed below
Sorting:
- Lightning Fast Language Prediction πβ167Updated last week
- Web page segmentation and noise removalβ55Updated last year
- remove signature blocks from emailsβ86Updated 6 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Parse, normalize and render postal addresses.β185Updated last year
- Skinfer is a tool for inferring and merging JSON schemasβ139Updated last year
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fiβ¦β48Updated 3 years ago
- Traptor -- A distributed Twitter feedβ26Updated 2 years ago
- Python library to infer date format from examplesβ45Updated 3 years ago
- Language detection extension for spaCy 2.0+β113Updated 6 years ago
- Hidden alignment conditional random field for classifying string pairs.β24Updated last week
- β70Updated 2 years ago
- Text classification using Naive Bayes and Elasticsearchβ154Updated 9 years ago
- Supervised learning for novelty detection in textβ78Updated 8 years ago
- Server/Client around Spacy to load spacy only onceβ46Updated 7 years ago
- A Cython implementation of the affine gap string distanceβ57Updated 2 years ago
- A simple fuzzy matching set for python stringsβ229Updated last year
- A python library detect and extract listing data from HTML page.β108Updated 8 years ago
- Demo code for learning_text_transformerβ25Updated 10 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractorsβ35Updated 10 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.β153Updated last month
- Spam filtering made easy for youβ144Updated 5 years ago
- A disk-based key/value store in Python with no dependencies.β21Updated 10 years ago
- Extract, parse and populate templates from stringsβ27Updated 6 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even wheβ¦β55Updated last year
- Python search module for fast approximate string matchingβ54Updated 2 years ago
- S3 Backups provides easy scripts that system administrators can use to backup data from programs likes PostgreSQL, MySQL, Redis, etc.β67Updated 7 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.β86Updated 7 years ago
- Python bindings to the Compact Language Detectorβ33Updated 5 years ago
- utils to use word embedding models like word2vec vectors in a PostgreSQL databaseβ144Updated 3 years ago