kemingy / Plane
A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so on.
☆11Updated 3 months ago
Alternatives and similar repositories for Plane:
Users that are interested in Plane are comparing it to the libraries listed below
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- Spell correct entire sentences using nltk freqdist and symspell☆19Updated 7 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated 10 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- A tutorial for basic data analysis with Pandas and Python. Designed to help people move from Excel to Pandas. Uses an SEO example.☆17Updated 6 years ago
- ☆14Updated last year
- Document Image Classification☆11Updated 6 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- An implementation of bidirectional LSTM-CRF for Named Entity Relationship on custom corpus with custom word embeddings☆13Updated 5 years ago
- SpacyV3 Text Categorizer Tutorial☆17Updated 4 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Python module for Named Entity Recognition (NER) using natural language processing.☆13Updated 3 years ago
- Word2Vec encodings based search engine for Stackoverflow questions☆26Updated last year
- Pre-built Scrapy spiders for AutoExtract☆19Updated 11 months ago
- This is an Object Oriented implementation of a Trie in python. The class contains setter and getter methods, and implements several usefu…☆14Updated 7 years ago
- Simple in memory data cache designed for ML applications. Built using Redis and Apache Arrow's Plasma in-memory store☆10Updated 4 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- [Tutorial] - Applying Word2Vec technique to Recommendation System a.k.a Item2Vec a.k.a Prod2Vec☆9Updated 3 years ago
- ☆16Updated 7 years ago
- Flask-based application using MySQL, MongoDB and Neo4j for storing video data and provides interface to search video and show related vid…☆10Updated 7 years ago
- ☆12Updated last year
- Techniques for Scraping the Web in Python☆26Updated 6 years ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 6 years ago
- This repository contains the code of the Rasa workshop at PyData NYC 2018☆12Updated 6 years ago
- A few end to end examples that use data-describe☆16Updated last year