tspannhw / FLiPStackWeeklyLinks
FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
☆21Updated this week
Alternatives and similar repositories for FLiPStackWeekly
Users that are interested in FLiPStackWeekly are comparing it to the libraries listed below
Sorting:
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- ☆13Updated 2 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆21Updated 3 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆168Updated this week
- ☆103Updated 9 months ago
- DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from da…☆48Updated 2 weeks ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- This repository contains recipes for Apache Pinot.☆32Updated 7 months ago
- A platform to manage the data product life cycle☆20Updated this week
- A Table format agnostic data sharing framework☆39Updated last year
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- rust-for-data☆46Updated 2 years ago
- A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink☆20Updated 2 years ago
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆37Updated 2 years ago
- Generative AI in realtime with Confluent Cloud.☆25Updated last year
- ☆60Updated last year
- Python package for querying iceberg data through duckdb.☆70Updated last year
- DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data qualit…☆64Updated last week
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆69Updated last week
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated 2 years ago
- Transporter for integrating OpenLineage with OpenMetadata☆15Updated last month
- Yet Another (Spark) ETL Framework☆21Updated last year
- ☆12Updated 3 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Updated last year
- ☆23Updated 2 weeks ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated 2 years ago
- Utility functions for dbt projects running on Spark☆33Updated 8 months ago