Renien / ETL-Starter-Kit
Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
☆21Updated 8 years ago
Alternatives and similar repositories for ETL-Starter-Kit:
Users that are interested in ETL-Starter-Kit are comparing it to the libraries listed below
- Flink Examples☆39Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆62Updated 5 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Analyzing Twitter real time feed with Spark Streaming☆32Updated 10 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 7 months ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- ☆49Updated 5 years ago
- Cascading on Apache Flink®☆54Updated last year
- ☆48Updated 7 years ago
- Apache Spark ETL Utilities☆40Updated 5 months ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 8 years ago
- ☆38Updated 7 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆62Updated last year
- This is a simple CEP Engine leveraging the Kafka Streams platform☆17Updated 7 years ago
- ☆16Updated 11 years ago
- MySQL to NoSQL real time dataflow☆18Updated 7 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- ☆54Updated 10 years ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Simple implementation of a custom parquet reader/writer☆11Updated 8 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- ☆31Updated 7 years ago
- Simple Spark app that reads and writes Avro data☆31Updated 10 years ago