kemingy / Plane
A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so on.
☆11Updated last month
Alternatives and similar repositories for Plane:
Users that are interested in Plane are comparing it to the libraries listed below
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- SpacyV3 Text Categorizer Tutorial☆17Updated 4 years ago
- Spell correct entire sentences using nltk freqdist and symspell☆19Updated 7 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 3 years ago
- An implementation of bidirectional LSTM-CRF for Named Entity Relationship on custom corpus with custom word embeddings☆13Updated 5 years ago
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated 8 months ago
- Document Image Classification☆11Updated 6 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- ☆14Updated last year
- A tool that is built using several open source services and uses Open AI's GPT-2 as a base model.☆4Updated 2 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Parse an email to get properties like salutation, body, signature, reply.☆44Updated last year
- Two Python classes that facilitate scraping of Instagram posts and graph modelling of hashtag data☆30Updated 4 years ago
- Code examples for Google Natural Language API.☆13Updated 5 years ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- A bidirectional LSTM example for sequence labeling.☆13Updated 6 years ago
- sequence tagging with spaCy and crfsuite☆18Updated last year
- classify a job description (or noisy job title) into a ONET job title☆18Updated 8 years ago
- 🦖 Streamlined Recommender Systems with TensorFlow and KubeFlow☆18Updated last year
- Extract dates from text☆64Updated 3 years ago
- Simple dashboard for getting currently trending hashtags and topics on Twitter☆26Updated last year
- ☆69Updated 3 years ago
- This repository contains the code of the Rasa workshop at PyData NYC 2018☆12Updated 6 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- SEMRush SERP Tutorial. Using advertools to Extract and Analyze Search Engine Results Pages Data☆14Updated 6 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Simple scripts to generate and use an Annoy index and lmdb map☆28Updated 7 years ago