Clearbox-AI / nerpii
A Python library to perform NER on structured data and generate PII with Faker
☆29Updated 10 months ago
Alternatives and similar repositories for nerpii:
Users that are interested in nerpii are comparing it to the libraries listed below
- A Python library to check for data quality and automatically generate data tests.☆42Updated last year
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆23Updated last month
- Python Biella Group basic template for a modern generic python application☆12Updated this week
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.☆36Updated this week
- A public repo that contains integrations for Argilla and LlamaIndex.☆14Updated 6 months ago
- A template for a starter project for ZenML☆13Updated 3 weeks ago
- A curated list of awesome resources for creating synthetic data☆42Updated 3 years ago
- Frouros: an open-source Python library for drift detection in machine learning systems.☆215Updated 2 months ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆130Updated last year
- Explore and compare 1K+ accurate decision trees in your browser!☆160Updated last year
- Python package for deduplication/entity resolution using active learning☆78Updated 7 months ago
- Knowledge pills on Neural Search☆26Updated last year
- MLOps Cookiecutter Template: A Base Project Structure for Secure Production ML Engineering☆40Updated 5 months ago
- ANJANA is a Python library for anonymizing sensitive data☆30Updated 3 weeks ago
- Product analytics for AI Assistants☆150Updated 2 weeks ago
- Cloud-agnostic Python API☆60Updated 10 months ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆230Updated this week
- An open-source compliance-centered evaluation framework for Generative AI models☆147Updated 4 months ago
- ☆17Updated last month
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆207Updated 5 months ago
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆75Updated 11 months ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.☆52Updated 7 months ago
- Turn your Python functions into interactive apps! Fast Dash is an innovative way to deploy your Python code as interactive web apps with …☆124Updated this week
- Streamlit EDA Dashboard Powered by AWS Cloud☆80Updated last year
- DQW is an EDA tool for training data.☆31Updated 2 years ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆24Updated last year
- Synthetic Data Engine 💎☆53Updated this week
- A place to put random stuff☆16Updated 11 months ago
- Research notes and extra resources for all the work at explodinggradients.com☆23Updated last month
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 9 months ago