sybrenjansen / text-scrubberLinks
Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographical text (countries/states/cities)
☆22Updated last year
Alternatives and similar repositories for text-scrubber
Users that are interested in text-scrubber are comparing it to the libraries listed below
Sorting:
- Custom Python functions for working with SQLite FTS4☆23Updated 3 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆68Updated 2 years ago
- Minimal State Machine☆24Updated 4 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆59Updated 2 years ago
- Declarative layer for your database.☆37Updated 2 years ago
- Elemental makes Selenium automation faster and easier.☆36Updated 2 years ago
- AsyncIO serving for data science models☆24Updated 3 years ago
- Sitemap generation for Python ASGI web apps☆24Updated last year
- Data encryption at rest and IAM for Python☆50Updated this week
- 🏗️ Create APIs from CSV files within seconds, using fastapi☆79Updated 4 years ago
- Versatile Metrics Collection for Python☆20Updated last month
- AlgoTree☆16Updated this week
- PyNLP Lib is an open source Python NLP library that provides functionality for both web and local development☆50Updated 3 years ago
- Declare multi-table rules for SQLAlchemy update logic -- 40X more concise, Python for extensibility.☆48Updated 3 months ago
- Python library to infer date format from examples☆45Updated 4 years ago
- Pydantic-based HTTP forms☆18Updated 7 months ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- A library for performing hyperparameter optimization☆63Updated last week
- The missing Python utility to read and write compressed JSONs.☆42Updated last year
- ☆70Updated 3 years ago
- Kubetools is a tool and processes for developing and deploying microservices to Kubernetes.☆15Updated last month
- A fast native implementation of diff algorithm with a pure Python fallback☆39Updated 3 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- A python module that will check for package updates.☆30Updated 4 years ago
- Simple Python code metering library☆31Updated 4 years ago
- Efficient string matching with regular expressions☆146Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆203Updated 10 months ago
- ⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.☆53Updated 2 years ago
- A lightweight, event-driven, pipeline framework.☆62Updated last year
- A task queue library for Python and Redis☆36Updated 3 years ago