harmening / signature_extraction
π¬NLP - Library for splitting email content into a human-written body and an automatically appended signature.
β25Updated 6 years ago
Alternatives and similar repositories for signature_extraction:
Users that are interested in signature_extraction are comparing it to the libraries listed below
- email dataset for email signature parsingβ55Updated 8 years ago
- Compound AI toolchain for fast and accurate entity matching, powered by LLMs.β21Updated this week
- A library to extract a publication date from a web page, along with a measure of the accuracy.β41Updated 5 years ago
- Integrate Watson Studio and Watson Campaign Automation to tailor your target audience for effective campaignsβ12Updated 3 years ago
- Convert english sentences to cypherβ32Updated 4 years ago
- A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so onβ¦β11Updated 2 months ago
- Find rss, atom, xml, and rdf feeds on webpagesβ30Updated 4 months ago
- Building a Job Datasetβ21Updated 2 years ago
- Create a music review RAG application with Neo4jβ19Updated 11 months ago
- Scrape various open data directories to create an index of what's available out thereβ36Updated last week
- Pre-built Scrapy spiders for AutoExtractβ19Updated 9 months ago
- Blazing fast fuzzy text search for Python.β42Updated last month
- get structured output from LLM'sβ32Updated last year
- Python tool to turn SQL Database Schemas into ChatGPT Promptsβ14Updated last year
- clustering news, extract trending news storiesβ12Updated 3 years ago
- Train a model, and detect gibberish strings with it.β60Updated 3 years ago
- An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn jobβ¦β18Updated 4 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3β18Updated 10 years ago
- LLM plugin for embeddings using sentence-transformersβ48Updated last week
- Seamless HTML table extraction for Pythonβ20Updated 8 years ago
- This repository contains an implementation of a US address parser built using spaCy NLP library.β37Updated last year
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trendsβ55Updated last year
- Quora Question Scraper - Find & Export relevant Questions 10x fasterβ16Updated 5 years ago
- Text analysis for automatic bookmarking/keyword extractionβ18Updated 8 years ago
- This program categorizes a given query's "search intent" via the kinds of SERP features present for the query.β23Updated 5 years ago
- Crawl sites for RSS, Atom, and JSON feeds.β69Updated 8 months ago
- A Python package to get useful information from documents using TopicRank Algorithm.β16Updated last year
- extract difference between two html pagesβ32Updated 6 years ago
- A News Article Collection Libraryβ22Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.β55Updated 2 months ago