YugantM / textcleanerLinks
text-data pre-processing utility
☆13Updated 3 years ago
Alternatives and similar repositories for textcleaner
Users that are interested in textcleaner are comparing it to the libraries listed below
Sorting:
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 7 years ago
- Language detection using Spacy and Fasttext☆57Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Text classification automl☆21Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 3 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 4 years ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 3 years ago
- ☆30Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 4 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- Deploy Pytorch models to production via panini☆10Updated 6 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborations☆14Updated 3 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Updated last year
- ☆21Updated 3 years ago
- Large Scale BERT Distillation☆33Updated 2 years ago
- Lazy Profiler is a simple utility to collect CPU, GPU, RAM and GPU Memory stats while the program is running.☆35Updated 4 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 5 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- ☆14Updated last year
- Named entity recognition for the legal domain☆42Updated 4 years ago
- ☆26Updated 2 years ago