edwardcooper / data-sentryLinks
A project to build a machine learning pipeline to detect personal identifiable information (PII)
☆16Updated 3 years ago
Alternatives and similar repositories for data-sentry
Users that are interested in data-sentry are comparing it to the libraries listed below
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆49Updated 6 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆46Updated 3 weeks ago
- Project Matt: Scan your AWS S3 Buckets for PII Data to Guard against GDPR☆14Updated 7 years ago
- A boilerplate solution for processing image and PDF documents for regulated industries, with lineage and pipeline operations metadata ser…☆67Updated 4 years ago
- ☆19Updated 4 years ago
- Docker images that replicate the Amazon SageMaker Notebook instance.☆57Updated 4 years ago
- A CLI for identifying potential Personally Identifiable Information in datasets.☆14Updated 6 years ago
- Natural Language Processing on AWS Workshop☆53Updated 7 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆28Updated 5 years ago
- A step-by-step guide that shows how to do text classification by run training/inference for a custom model in Amazon SageMaker☆109Updated 6 years ago
- Using Jupyter notebook to develop DevOps automated environment to start and stop SageMaker notebook instances out of working hours☆21Updated 7 years ago
- Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub☆336Updated 2 years ago
- ☆49Updated last year
- Using Amazon Comprehend, Amazon Elasticsearch with Kibana, Amazon S3, Amazon Cognito to search over large number of documents.☆24Updated last year
- aws-solutions-library-samples / guidance-for-natural-language-queries-of-relational-databases-on-awsDemonstration of Natural Language Query (NLQ) of an Amazon RDS for PostgreSQL database, using SageMaker JumpStart, Amazon Bedrock, LangCh…☆71Updated last year
- ☆39Updated 2 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 9 years ago
- Repository for hosting models on AWS blog post☆14Updated 6 years ago
- A sample set of notebooks demonstrating Amazon Comprehend capabilities.☆46Updated 2 years ago
- This workshop demonstrates two methods of machine learning inference for global production using AWS Lambda and Amazon SageMaker☆58Updated 5 years ago
- Setup end to end demo architecture for predicting fraud events with Machine Learning using Amazon SageMaker☆330Updated last year
- AWS CloudFormation templates and Python code for AWS blog post on how to automate IAM credential reports at scale across AWS.☆18Updated 3 years ago
- ☆96Updated 4 years ago
- ☆20Updated 3 years ago
- ☆20Updated 2 years ago
- The AI-Driven Social Media Dashboard solutions provides customers with a CloudFormation template that is easy to deploy to use Amazon Tra…☆60Updated 4 years ago
- ☆32Updated last year
- A serverless app that periodically polls the public Twitter Standard Search API and invokes a given lambda function to process new tweets☆103Updated 2 years ago
- Post-process Amazon Textract results with Hugging Face transformer models for document understanding☆102Updated last year
- ☆35Updated 11 months ago