iagcl / data_pipelineLinks
Data Pipeline is a Python application for replicating data from source to target databases
☆17Updated 7 years ago
Alternatives and similar repositories for data_pipeline
Users that are interested in data_pipeline are comparing it to the libraries listed below
Sorting:
- An AWS Lambda package including two functions to dynamically maintain a security partition around a group of AWS resources which originat…☆12Updated 6 years ago
- Cloudformation and SQL scripts used to replicate a POC environment from the "Data Lake to Data Warehouse: Enhancing Customer 360 with Ama…☆31Updated 5 years ago
- Configure an LDAPS Endpoint for Simple AD☆14Updated 7 years ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- Distributed workflow progress tracker☆12Updated 6 years ago
- A cookiecutter template to create AWS Lambda function☆23Updated 6 years ago
- Automate AWS lambda functions migration across account using CloudFormation☆12Updated 5 years ago
- Alexa GuardDuty Sample Skill☆14Updated 7 years ago
- Code samples related to "Harmonize, Search, and Analyze Loosely Coupled Datasets on AWS" (https://aws.amazon.com/blogs/big-data/harmonize…☆22Updated 6 years ago
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆26Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.☆16Updated 11 months ago
- Sandbox for Apache nifi☆24Updated 3 years ago
- ARCHIVED - see https://aws.amazon.com/about-aws/whats-new/2019/04/Amazon-S3-Introduces-S3-Batch-Operations-for-Object-Management/ Amazo…☆26Updated 7 years ago
- This GitHub project provides a series of lab exercises which help users get started using the Redshift platform.☆53Updated 4 years ago
- Operational Data Processing Framework developed using AWS Glue and Apache Hudi. This framework is suitable for Data Lake and Modern Data …☆22Updated last year
- Terraform script for launching multiple EMR clusters for training purposes.☆16Updated last year
- DEPRECATED - An AWS Lambda powered monitoring framework for security, compliance, and best practices.☆31Updated 6 years ago
- Jupyter notebook that calls Rekognition, displays an image, and calls a local Neo4j DB to display a graph of relationships☆27Updated 5 years ago
- Glue Python Shell Job that adds AWS Organizations account tags to Cost and Usage Reports. You can submit feedback & requests for changes…☆16Updated 4 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- This tool generates emulated data stream based on the NYC Taxi & Limousine Commission’s open dataset expanded with additional routing inf…☆13Updated 6 years ago
- A collection of recipes for building AWS service, ElasticSearch etc.☆12Updated 4 years ago
- Contains a collection of serverless apps that wrap common financial functions as AWS Lambda functions☆61Updated 11 months ago
- Examples demonstrating how to use Amazon S3 Inventory to analyze your S3 storage using Spark and EMR.☆19Updated 5 years ago
- Python script to automatically sync new instances via AWS CodeDeploy APIs☆15Updated 7 years ago
- Reference Architectures for Datalakes on AWS☆79Updated 5 years ago
- Utilities for use with the AWS Discovery Service API☆27Updated 4 years ago
- AWS Quick Start Team☆23Updated 8 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago