A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset
☆20Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for apache_flink_and_iceberg
Users that are interested in apache_flink_and_iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High throughput streaming of Protobuf data from Kafka into DuckDB☆12Mar 4, 2026Updated 3 weeks ago
- Benchmark tool to test StarRocks using several benchmarks.☆16Feb 15, 2022Updated 4 years ago
- Generate tpch data in parquet format☆15Jan 25, 2023Updated 3 years ago
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- This project demonstrates Real-Time streaming of CDC data from MySql to Apache Iceberg using Flink SQL Client for faster data analytics a…☆23Jan 16, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Zap file format compatible with a future version of Bleve☆15Updated this week
- ☆13Jun 10, 2024Updated last year
- grafana-loki-demo☆17May 16, 2021Updated 4 years ago
- 一个功能强大的 Bilibili 直播间弹幕 WebSocket 客户端 Rust 库,支持实时弹幕监控、文字转语音(TTS)和浏览器 Cookie 自动检测。A powerful Bilibili live room DM (Danmaku) WebSocket clie…☆27Mar 14, 2026Updated last week
- Tensorflow is not only an well designed deep learning toolbox, but also a standard symbolic programming framework. In this repository, we…☆12Oct 15, 2018Updated 7 years ago
- Iceberg Playground in a Box☆67Updated this week
- An Ibis back-end for the GizmoSQL Arrow Flight SQL Server (with the DuckDB engine)☆16Mar 2, 2026Updated 3 weeks ago
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jan 31, 2024Updated 2 years ago
- redash docker-compose running☆17Jan 6, 2019Updated 7 years ago
- Protobuf to Arrow, using Rust☆24Mar 20, 2026Updated last week
- Not updated, use☆11Jan 9, 2017Updated 9 years ago
- Benchmark dataset of solar PV EL images and the corresponding ground truth masks☆18Mar 10, 2024Updated 2 years ago
- 基于JavaFX的炫酷多功能精美本地音乐播放器V1.0☆10May 29, 2020Updated 5 years ago
- A Trino ODBC driver☆14Jan 10, 2024Updated 2 years ago
- Go library for decoding generic map values and native Go structures into Arrow.☆17Jan 30, 2026Updated last month
- Go library to stream Kafka protobuf messages to DuckDB☆25Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- How to use Amazon S3 Website Hosting feature to host Static Website☆11Apr 14, 2019Updated 6 years ago
- This project demonstrates how to integrate DuckLake, SQLMesh, and Neon PostgreSQL to create a modern data lakehouse architecture with ver…☆27Jun 3, 2025Updated 9 months ago
- ☆12Jul 11, 2022Updated 3 years ago
- Apache iceberg Spark s3 examples☆21Mar 1, 2024Updated 2 years ago
- The home of Floecat: A catalog of catalogs for open table formats☆58Updated this week
- version 2 of the Unified Cybersecurity Ontology☆16May 7, 2017Updated 8 years ago
- ☆14Jun 23, 2025Updated 9 months ago
- Code release of "Deep Visual-Semantic Quantization of Efficient Image Retrieval" (CVPR 17)☆11Apr 5, 2017Updated 8 years ago
- A cli for spinning up and managing Ray clusters for the Daft Query Engine.☆15Feb 15, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A repository of Apache Spark projects, training projects, and tutorials, in both Scala and Python.☆33Sep 15, 2021Updated 4 years ago
- CNN-based identification of defective solar cells in electroluminescence imagery☆14Jun 27, 2022Updated 3 years ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated last year
- Insight Data Science Project: Predicting Photovoltaic Solar Panel Generation Using Machine Learning☆18Dec 4, 2022Updated 3 years ago
- A Golang library for interacting with the EPSS (Exploit Prediction Scoring System).☆30Feb 16, 2025Updated last year
- BigQuery Schema Conversion Tool☆23Oct 6, 2020Updated 5 years ago
- Material for a course on applied machine-learning for scientists. Taught at EPFL in spring 2018.☆11May 3, 2018Updated 7 years ago