kradecki / infaLinks
Python API for Informatica PowerCenter (pmrep, pmcmd)
☆21Updated 8 years ago
Alternatives and similar repositories for infa
Users that are interested in infa are comparing it to the libraries listed below
Sorting:
- ☆16Updated 6 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- ☆118Updated 5 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 6 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 2 years ago
- ☆26Updated 5 years ago
- Code Repository for GCP: Complete Google Data Engineer and Cloud Architect Guide(v), Published by Packt☆16Updated 2 years ago
- This repository contains code for Spark Streaming☆25Updated 4 years ago
- ☆26Updated 2 years ago
- Resources for video demonstrations and blog posts related to DataOps on AWS☆182Updated 3 years ago
- Learn how to auto-ingest streaming data into Snowflake using Snowpipe.☆23Updated 3 years ago
- A Snowflake Sandbox for Data Science☆36Updated 4 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Updated 5 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Updated 7 years ago
- ☆23Updated 3 years ago
- Snowflake Cookbook, published by Packt☆82Updated 2 years ago
- A compact framework for automating a Snowflake analytics pipeline on Amazon ECS.☆18Updated 2 years ago
- Example DAGs using hooks and operators from Airflow Plugins☆347Updated 7 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆22Updated last year
- Supplementary material for Building a Modern Data Platform with Snowflake, from Pearson.☆20Updated 4 years ago
- ☆54Updated 5 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Updated 5 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 5 years ago
- Airflow training for the crunch conf☆104Updated 7 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 6 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆28Updated 5 years ago
- ☆24Updated 5 years ago
- ETL pipeline using pyspark (Spark - Python)☆116Updated 5 years ago
- In this workshop you will launch an Amazon Redshift cluster in your AWS account and load sample data ~ 100GB using TPCH dataset. You will…☆24Updated 7 years ago