edwardcooper / data-sentryLinks
A project to build a machine learning pipeline to detect personal identifiable information (PII)
☆16Updated 2 years ago
Alternatives and similar repositories for data-sentry
Users that are interested in data-sentry are comparing it to the libraries listed below
Sorting:
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets…☆45Updated 3 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆30Updated 4 years ago
- Streamlit deployment on AWS Fargate☆12Updated 4 years ago
- https://duyet.github.io/related-skills-visualization/index.html☆11Updated 4 years ago
- ☆18Updated 2 years ago
- Using Jupyter notebook to develop DevOps automated environment to start and stop SageMaker notebook instances out of working hours☆22Updated 6 years ago
- Search for PII in Python☆29Updated last year
- ☆22Updated 3 years ago
- Orchestration of data processing tasks to power the Open Skills Project☆16Updated last week
- ☆31Updated 2 years ago
- Project Matt: Scan your AWS S3 Buckets for PII Data to Guard against GDPR☆14Updated 7 years ago
- A simple search engine to search medium stories built with streamlit and elasticsearch.☆40Updated 3 years ago
- ☆16Updated 2 years ago
- ☆20Updated last year
- ☆19Updated 2 years ago
- An analysis of abilities, skills and tech skills data from the O*NET database as well as classification of around 500 random LinkedIn job…☆18Updated 4 years ago
- This application guides you through the development of a language model that classifies clinical documents according to their medical spe…☆12Updated 9 months ago
- Combining the search power of Elasticsearch with the Question Answering power of GPT☆84Updated last year
- Using LangChain's SQL Database Chain and Agent with various LLMs to perform Natural Language Queries (NLQ) of an Amazon RDS for PostgreSQ…☆48Updated last year
- this is a Manual Named-Entities/Part-of-speech Tagger for Spacy, You can use it to create your own training datasets.☆12Updated 6 years ago
- A CLI for identifying potential Personally Identifiable Information in datasets.☆13Updated 6 years ago
- Streamlit application to explore Snowflake Tables☆41Updated last year
- ☆28Updated 4 years ago
- Follow the Lumiata Tech Blog on Medium!☆21Updated 2 years ago
- ETL process which downloads, transforms, and loads Freddie Mac/Fannie Mae mortgage data☆19Updated 7 years ago
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆55Updated 10 months ago
- Deep learning and AI projects☆26Updated 6 years ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆32Updated last year
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago