linkedin / Hoptimator
Multi-hop declarative data pipelines
☆86Updated last month
Related projects: ⓘ
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆51Updated this week
- Simple project to expose a catalog over REST using a Java catalog backend☆102Updated this week
- ☆77Updated last year
- In-Memory Analytics for Kafka using DuckDB☆63Updated this week
- ☆197Updated last month
- ☆130Updated last month
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆179Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆80Updated 6 months ago
- Flexible development framework for building streaming data applications in SQL with Kafka, Flink, Postgres, GraphQL, and more.☆90Updated this week
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆151Updated last month
- A dbt adapter for Decodable☆11Updated 8 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated 10 months ago
- Generate authentic looking mock data based on a SQL, JSON or Avro schema and produce to Kafka in JSON or Avro format.☆141Updated 2 weeks ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆55Updated 11 months ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 4 months ago
- Storage connector for Trino☆90Updated 2 weeks ago
- Apache Hive Metastore as a Standalone server in Docker☆64Updated 3 weeks ago
- Dashboard for operating Flink jobs and deployments.☆25Updated 5 months ago
- ☆144Updated this week
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆90Updated 2 weeks ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Updated last year
- Traffic routing for Trino Clusters☆24Updated last week
- ☆49Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated 2 weeks ago
- ☆39Updated last year
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆37Updated last year
- Dione - a Spark and HDFS indexing library☆49Updated 5 months ago
- ☆129Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆82Updated 5 months ago
- A Table format agnostic data sharing framework☆36Updated 7 months ago