Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
☆80Mar 13, 2026Updated last week
Alternatives and similar repositories for platys-modern-data-platform
Users that are interested in platys-modern-data-platform are comparing it to the libraries listed below
Sorting:
- A tool for generating docker-compose environments☆27Aug 5, 2025Updated 7 months ago
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking☆12Apr 11, 2025Updated 11 months ago
- File streaming service designed for Kubernetes to provide ReadWriteMany storage support☆14Jul 17, 2023Updated 2 years ago
- Grok Expression Transform for Kafka Connect.☆16Feb 8, 2021Updated 5 years ago
- A python3 module that converts your bs4 Tag into json object (dict)☆16Updated this week
- LangChain Expression Language (LCEL) As NiFi Processors☆15Feb 13, 2024Updated 2 years ago
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- a subset of sql dialect for clickhouse db.☆13Mar 5, 2026Updated 2 weeks ago
- Notes that I should one day turn into a blog or something ...☆33Jan 13, 2026Updated 2 months ago
- A library for error handling in Kafka Streams.☆20Mar 2, 2026Updated 2 weeks ago
- An Ansible collection of utilities and other resources for Cloudera Platform deployments☆13Nov 13, 2025Updated 4 months ago
- Courses that Dr. Xu teaches☆17Jan 23, 2019Updated 7 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- A platform and cloud-based service for data sharing based on the Delta Sharing protocol.☆21Jun 12, 2024Updated last year
- A Kafka Serde that reads and writes records from and to Blob storage (S3, Azure, Google) transparently.☆63Mar 2, 2026Updated 2 weeks ago
- Cal-ITP data infrastructure☆69Updated this week
- Distributed locks on java based on Redis database and Jedis library. Also have distributed Java collections and scan iterators.☆18Mar 7, 2026Updated last week
- ☕⛵WIP PySpark dependency management☆22Jul 8, 2018Updated 7 years ago
- Source code of webpro.nl☆11Oct 12, 2025Updated 5 months ago
- Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store☆17Oct 20, 2022Updated 3 years ago
- Create Data Mesh. Use interoperable digital twins to create data interactions and build powerful real-time data products. This repository…☆16Sep 24, 2024Updated last year
- repo to push all subgraphs created☆21Nov 10, 2022Updated 3 years ago
- New Generation Opensource Data Stack Demo☆456Feb 6, 2023Updated 3 years ago
- VSCode extension for working with Architecture As A Code in the C4 model. Includes syntax highlighting, diagram preview, and tools for wo…☆35Mar 6, 2026Updated 2 weeks ago
- Fugue collections for Prefect 2.0☆38Oct 18, 2023Updated 2 years ago
- Apache Ranger Plugin for S3☆20Nov 30, 2022Updated 3 years ago
- Apache flink☆16Jul 12, 2025Updated 8 months ago
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Jan 19, 2024Updated 2 years ago
- Tutorials for GDSO at Berkeley Data Science Workshop☆15Jul 13, 2018Updated 7 years ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- Testing Boring SL with DuckDB☆32Aug 18, 2025Updated 7 months ago
- ☆67May 9, 2025Updated 10 months ago
- Pypi package used for type hinting when creating MAGE modules.☆17Sep 5, 2021Updated 4 years ago
- A collection of pipelines for Scrapy☆16Mar 13, 2026Updated last week
- Serverless for data practitioners. The fastest ⚡️ way to run your code in the cloud. Effortlessly run scripts, functions, and Jupyter not…☆41Feb 19, 2024Updated 2 years ago
- Shared toolset for OS apps☆15Oct 31, 2019Updated 6 years ago
- ☆22Jul 18, 2024Updated last year
- Repository for Data Engineering Zoomcamp 2024☆14Mar 25, 2024Updated last year