harmening / signature_extractionLinks
💬NLP - Library for splitting email content into a human-written body and an automatically appended signature.
☆26Updated 6 years ago
Alternatives and similar repositories for signature_extraction
Users that are interested in signature_extraction are comparing it to the libraries listed below
Sorting:
- email dataset for email signature parsing☆55Updated 9 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- A simple machine learning package to cluster keywords in higher-level groups.☆16Updated 2 years ago
- Text summarization using spacy☆22Updated 2 years ago
- ☆16Updated 4 years ago
- [archived]☆18Updated 3 years ago
- Building a Job Dataset☆22Updated 3 years ago
- ☆16Updated 7 years ago
- remove signature blocks from emails☆86Updated 6 years ago
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- Now included in rigour☆151Updated last month
- A collection of libraries to simplify interacting with Google's Advanced Services and 3rd party APIs in Google Apps Script.☆16Updated 3 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- This is a proof-of-concept of using an LLM to find and extract meaningful data without parsing the html too much.☆29Updated 2 years ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated last week
- Grounding LLMs in truth with under 30 lines of code.☆20Updated last year
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- Manage google drive files made easy☆11Updated 10 months ago
- Apache Tika Server with Tesseract 4 Docker Setup☆23Updated 4 years ago
- Extracts synonyms for various terms, exploiting the redirects between terms in Wikipedia☆12Updated 6 years ago
- ProxyCrawl Python library for scraping and crawling☆59Updated last year
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- LexPredict ContraxSuite document samples☆23Updated 7 years ago
- Phantombuster's SDK☆14Updated 8 months ago
- Parsing resumes in a PDF format from linkedIn☆68Updated 8 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- URL articles text summarizer using Web Crawling and NLP (written in Python)☆48Updated 4 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago