PastorGL / datacooker-etlLinks
ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included
☆17Updated 2 weeks ago
Alternatives and similar repositories for datacooker-etl
Users that are interested in datacooker-etl are comparing it to the libraries listed below
Sorting:
- ITSumma Spark Greenplum Connector☆42Updated last year
- ☆18Updated 4 years ago
- JDBCX: Extended JDBC driver for dynamic multi-language queries with optional bridge server for federated datasource connectivity.☆27Updated last month
- One ETL tool to rule them all☆84Updated 2 weeks ago
- This project is used to capture machine learning pipelines created on top of Spark as OK☆54Updated 3 years ago
- Data catalog for everything in your company☆50Updated 2 years ago
- Vostok Hercules is an open-source distributed system based on Apache Kafka and used for reliable delivery of telemetry data from microser…☆46Updated 2 years ago
- How to build your first Spark application with MLlib, StructuredStreaming, GraphFrames, Datasets and so on? Answer is here!☆53Updated 6 years ago
- ☆52Updated last year
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆43Updated last week
- The Proxima platform.☆22Updated 2 weeks ago
- A tiny embedded Java-engine for extremely fast partitioned immutable-after-construction databases☆113Updated 3 years ago
- DSL for generating Grafana dashboards☆62Updated 3 years ago
- All stuff in a single repo (tests, ideas, benchmarks)☆24Updated 2 years ago
- Dimension UI is a desktop application designed to collect, store, visualize, and analyze real-time data☆43Updated 2 weeks ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆171Updated 2 weeks ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Kafka Connector for Iceberg tables☆16Updated 2 years ago
- Library for generating avro schema files (.avsc) based on DB tables structure☆52Updated last year
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆123Updated 3 weeks ago
- Dockerized runner, utilities, and functions for FlinkSQL applications☆27Updated last week
- Java library for TRUE database access☆23Updated 3 months ago
- Convert XSD -> AVSC and XML -> AVRO☆37Updated 4 years ago
- A set of tools to roll out your own hadoop distro.☆15Updated 7 years ago
- Monitoring and insights on your data lakehouse tables☆33Updated last month
- pg-index-health is an embeddable schema linter for PostgreSQL that detects common anti-patterns and promotes best practices.☆186Updated last week
- ☆24Updated 3 years ago
- Apache iceberg Spark s3 examples☆20Updated last year
- Sample processing code using Spark 2.1+ and Scala☆51Updated 5 years ago
- Transporter for integrating OpenLineage with OpenMetadata☆15Updated 3 months ago