Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and reporting solution with Amazon EMR, AWS Glue, and Amazon QuickSight".
☆20May 13, 2020Updated 6 years ago
Alternatives and similar repositories for data-profiler-for-aws-glue-data-catalog
Users that are interested in data-profiler-for-aws-glue-data-catalog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Mar 31, 2021Updated 5 years ago
- This repository has configuration files to set up an open-source tool named Okta AWS CLI Assume Role Tool (https://github.com/oktadevelop…☆10May 18, 2020Updated 6 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Feb 8, 2023Updated 3 years ago
- Use Amazon Lex as a conversational interface with Twilio Media Streams☆13Feb 20, 2026Updated 4 months ago
- Sample code that reads Microsoft Excel workbook/CSV File for the details required to create a DMS task CloudFormation template☆14Jan 21, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains sample code that is used to demonstrate building, deploying and invoking a SageMaker model for heart disease pre…☆10Oct 14, 2020Updated 5 years ago
- ☆13Aug 5, 2020Updated 5 years ago
- ☆11May 24, 2023Updated 3 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Oct 17, 2023Updated 2 years ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Mar 14, 2021Updated 5 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Apr 14, 2023Updated 3 years ago
- [DEPRECATED] GAE python based app which regularly collects information about GCP resources and stores them in BigQuery☆45Aug 31, 2023Updated 2 years ago
- AWS Glue Configurable Test Data Generator for S3 Data Lakes and DynamoDB☆19Jan 19, 2026Updated 5 months ago
- Design best practices for building scalable ETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) data processing pipelines using…☆17Dec 8, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Sample code with integration between Data Catalog and Hive data source.☆24Jan 29, 2025Updated last year
- ☆24Jul 15, 2022Updated 3 years ago
- ☆17Jul 21, 2025Updated 11 months ago
- Exploring how AWS AppSync can utilize AWS Lambda to integrate with alternative data sources, including Amazon ElastiCache and Amazon Nept…☆14Aug 22, 2019Updated 6 years ago
- Plugin for Intake to read from SQL servers☆15May 29, 2023Updated 3 years ago
- Samples to help you get started with the AWS Data Exchange API.☆22Oct 28, 2024Updated last year
- Replication utility for AWS Glue Data Catalog☆80Aug 8, 2024Updated last year
- ☆16Jan 31, 2022Updated 4 years ago
- AWS Step Function Implementation in JS, so you can run your Node.js lambda handlers in your test environments. Made to support Serverless…☆15Jun 22, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- The sample code provides a deploy function and an executable to easily deploy an Amazon Lex bot based on a Lex Schema file.☆23Nov 2, 2023Updated 2 years ago
- ☆14Feb 23, 2021Updated 5 years ago
- Auto-mirror of scoopinstaller/scoop-main bucket☆12Updated this week
- Capistrano tasks to deploy into docker.☆13Mar 4, 2023Updated 3 years ago
- Python API for Deequ☆41Nov 10, 2020Updated 5 years ago
- Events about the open source data stack☆13Apr 16, 2022Updated 4 years ago
- Limit long text output for a single JupyterLab mime render.☆13Jun 10, 2026Updated 3 weeks ago
- Diagrams as code - A simple mobile first editor for UML diagrams built using AWS Lambda, Google Cloud Kubernetes, DynamoDB etc.☆14Mar 26, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DEPRECATED - An AWS CloudFormation macro to allow the definition of Amazon States Language in YAML within a CloudFormation template☆16Aug 23, 2021Updated 4 years ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆13May 5, 2026Updated 2 months ago
- Secure and performant OCI-image builder for Kubernetes☆14Updated this week
- ☆22Feb 17, 2020Updated 6 years ago
- Simple command line creation and editing of Evernote notes with Markdown and your favorite text editor☆32Jun 11, 2019Updated 7 years ago
- Command line client for the Fugue API☆14Mar 7, 2023Updated 3 years ago
- Indonesia GitHub stats for fun. Frontend at https://github.com/antonybudianto/gitcard☆14Feb 25, 2023Updated 3 years ago