YugantM / textcleaner
text-data pre-processing utility
☆13Updated 2 years ago
Alternatives and similar repositories for textcleaner:
Users that are interested in textcleaner are comparing it to the libraries listed below
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Updated 5 years ago
- Text classification automl☆21Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 2 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- 🦖 Streamlined Recommender Systems with TensorFlow and KubeFlow☆18Updated last year
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- Experimental library for sampling and validating scikit-learn parameters☆10Updated 5 years ago
- Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource Languages☆9Updated 2 years ago
- Deep Learning and Natural Language Processing using PyTorch (O'Reilly AI - NYC, 2019)☆11Updated 5 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- ☆13Updated 3 years ago
- ☆15Updated 4 years ago
- ☆30Updated 2 years ago
- A Python package to get useful information from documents using TopicRank Algorithm.☆16Updated last year
- ☆22Updated 2 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated 10 months ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- Generating Training Data Made Easy☆43Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Text preprocessing tools in python.☆26Updated 6 years ago
- This is a custom library for data processing, visualization and machine learning tools.☆13Updated last month
- Tensorflow object detection api in single line☆12Updated 3 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last month
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago