Clearbox-AI / nerpii
A Python library to perform NER on structured data and generate PII with Faker
☆29Updated 8 months ago
Alternatives and similar repositories for nerpii:
Users that are interested in nerpii are comparing it to the libraries listed below
- A Python library to check for data quality and automatically generate data tests.☆43Updated last year
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆20Updated last week
- Python package for deduplication/entity resolution using active learning☆76Updated 5 months ago
- A curated list of awesome resources for creating synthetic data☆41Updated 3 years ago
- A template for a starter project for ZenML☆13Updated 2 months ago
- Frouros: an open-source Python library for drift detection in machine learning systems.☆208Updated 2 weeks ago
- Python Biella Group basic template for a modern generic python application☆12Updated 8 months ago
- Explore and compare 1K+ accurate decision trees in your browser!☆159Updated 11 months ago
- ☆37Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 4 months ago
- Knowledge pills on Neural Search☆25Updated last year
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.☆30Updated this week
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆42Updated 5 months ago
- Practical examples of "Flawed Machine Learning Security" together with ML Security best practice across the end to end stages of the mach…☆105Updated 2 years ago
- Metrics to evaluate quality and efficacy of synthetic datasets.☆222Updated this week
- A set of utilities to quicky analyze time series.☆22Updated 3 years ago
- An open-source compliance-centered evaluation framework for Generative AI models☆129Updated 2 months ago
- Open-Source Software, Tutorials, and Research on Data-Centric AI 🤖☆330Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆21Updated 2 years ago
- An abstraction layer for parameter tuning☆35Updated 5 months ago
- A software package for privacy-preserving generation of a synthetic twin to a given sensitive data set.☆51Updated 5 months ago
- ☆17Updated 7 months ago
- A repository that showcases how you can use ZenML with Git☆69Updated 6 months ago
- Python package for text mining of time-series data☆69Updated 2 months ago
- Product analytics for AI Assistants☆139Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆22Updated last year
- ☆32Updated last year
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 7 months ago
- A curated list of awesome synthetic data tools (open source and commercial).☆151Updated last year