Clearbox-AI / nerpii
A Python library to perform NER on structured data and generate PII with Faker
☆28Updated 7 months ago
Alternatives and similar repositories for nerpii:
Users that are interested in nerpii are comparing it to the libraries listed below
- A Python library to check for data quality and automatically generate data tests.☆43Updated last year
- Frouros: an open-source Python library for drift detection in machine learning systems.☆205Updated last week
- Python package for deduplication/entity resolution using active learning☆78Updated 4 months ago
- Knowledge pills on Neural Search☆25Updated last year
- Python Biella Group basic template for a modern generic python application☆12Updated 7 months ago
- A curated list of awesome resources for creating synthetic data☆41Updated 2 years ago
- An abstraction layer for parameter tuning☆36Updated 4 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆127Updated last year
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆69Updated 8 months ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆26Updated 6 months ago
- It's a cooler way to store simple linear models.☆28Updated 6 months ago
- A template to kick-start your Python project ✨🚀☆50Updated 3 weeks ago
- Explore and compare 1K+ accurate decision trees in your browser!☆157Updated 10 months ago
- A template for a starter project for ZenML☆12Updated last month
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Updated 10 months ago
- ☆23Updated 2 years ago
- ☆37Updated last year
- ☆32Updated last year
- Example project with a complete MLOps cycle: versioning data, generating reports on pull requests and deploying the model on releases wit…☆47Updated 3 years ago
- Maternal Health Risk prediction MLOps pipeline☆41Updated 2 years ago
- 🦫 MLOps for (online) machine learning☆84Updated 9 months ago
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineering☆40Updated 2 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆68Updated last month
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆22Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆78Updated 3 weeks ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆220Updated this week
- A repository that showcases how you can use ZenML with Git☆69Updated 5 months ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- Pipeline components that support partial_fit.☆44Updated 6 months ago