Clearbox-AI / nerpiiLinks
A Python library to perform NER on structured data and generate PII with Faker
☆30Updated last year
Alternatives and similar repositories for nerpii
Users that are interested in nerpii are comparing it to the libraries listed below
Sorting:
- A Python library to check for data quality and automatically generate data tests.☆42Updated last year
- An open-source Python library for the assessment of utility and privacy performance of any tabular synthetic dataset.☆23Updated 2 weeks ago
- Python Biella Group basic template for a modern generic python application☆12Updated 2 months ago
- Clearbox AI's all-in-one solution for generation and evaluation of synthetic tabular and time-series data.☆43Updated last month
- Use local LLMs for advanced NLP☆25Updated 3 months ago
- Knowledge pills on Neural Search☆26Updated 2 years ago
- Frouros: an open-source Python library for drift detection in machine learning systems.☆221Updated 2 weeks ago
- A public repo that contains integrations for Argilla and LlamaIndex.☆15Updated 8 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆26Updated last year
- ☆68Updated 4 months ago
- Synthetic Data Engine 💎☆63Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆79Updated 6 months ago
- NIST Collaborative Research Cycle on Synthetic Data. Learn about Synthetic Data week by week!☆27Updated last year
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated 11 months ago
- Explore and compare 1K+ accurate decision trees in your browser!☆164Updated last year
- A template for a starter project for ZenML☆14Updated 3 months ago
- ☆38Updated last year
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆184Updated 2 months ago
- Product analytics for AI Assistants☆153Updated last month
- A microframework for creating simple AI agents.☆90Updated 10 months ago
- Python package for deduplication/entity resolution using active learning☆80Updated 10 months ago
- pyCANON is a Python library and CLI to assess the values of the parameters associated with the most common privacy-preserving techniques.☆38Updated last week
- A curated list of awesome resources for creating synthetic data☆42Updated 3 years ago
- a unified framework for leveraging LLMs☆75Updated this week
- Cloud-agnostic Python API☆60Updated last year
- Playing with Python Bluesky SDK☆15Updated 7 months ago
- A template to kick-start your Python project ✨🚀☆51Updated 6 months ago
- A place to put random stuff☆17Updated last year
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆39Updated 2 years ago
- An open-source package for python to clean raw text data☆70Updated last year