YugantM / textcleanerLinks
text-data pre-processing utility
☆13Updated 3 years ago
Alternatives and similar repositories for textcleaner
Users that are interested in textcleaner are comparing it to the libraries listed below
Sorting:
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- ☆30Updated 3 years ago
- Language detection using Spacy and Fasttext☆57Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 7 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Text classification automl☆21Updated 4 years ago
- Aho-Corasick string replacement utility☆25Updated 5 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Applying Snorkel to SuperGLUE☆26Updated 5 years ago
- Loan Risk Prediction Neural Network and API☆17Updated 5 years ago
- ☆14Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- Exploring semantic similarities between contextualized embeddings☆14Updated 4 years ago
- ☆33Updated 6 years ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆100Updated 11 months ago
- Rust python bindings for symspell☆21Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Large Scale BERT Distillation☆33Updated 2 years ago