Capture the logical plan from Spark (SQL)
☆22Mar 6, 2021Updated 4 years ago
Alternatives and similar repositories for SparkDataLineageCapture
Users that are interested in SparkDataLineageCapture are comparing it to the libraries listed below
Sorting:
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- A platform to manage the data product life cycle☆22Feb 11, 2026Updated 2 weeks ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 4 months ago
- 一个优秀的大数据查询平台,提供hive异步任务查询、LDAP用户、数据权限控制、历史查询任务与结果存储、邮件通知、excel下载等功能。☆24Dec 30, 2017Updated 8 years ago
- Grafana's table plugin for ClickHouse☆26Jul 7, 2022Updated 3 years ago
- An sbt plugin for publishing packages to AWS CodeArtifact.☆26May 29, 2024Updated last year
- Java task scheduler to execute threads which dependency is managed by directed acyclic graph☆25Feb 2, 2017Updated 9 years ago
- Template for running ActivePivot as a Spring Boot application☆13Oct 14, 2024Updated last year
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Apr 20, 2015Updated 10 years ago
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆15Jan 23, 2025Updated last year
- This is a complete suite of spring boot couchbase and kafka☆12Dec 10, 2018Updated 7 years ago
- Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark☆10Aug 17, 2018Updated 7 years ago
- 支持分库分表jdbc的flink connector☆10Dec 31, 2021Updated 4 years ago
- Bridge to MetaTrader4 over ODBC interface☆18Aug 29, 2011Updated 14 years ago
- My branch of Apache Flume with a generic JDBC sink (not yet licensed to Apache)☆11Feb 12, 2022Updated 4 years ago
- Github action for running python unit tests☆10Jun 16, 2025Updated 8 months ago
- seckill秒杀项目【PRC】☆10Apr 13, 2019Updated 6 years ago
- Scala library for parsing fixed length file format☆13Oct 19, 2021Updated 4 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- An MCP (Model Context Protocol) server for data transformation and BI charts will allow AI assistants to connect to your data sources, tr…☆13Mar 31, 2025Updated 11 months ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated 11 months ago
- ☆10Aug 13, 2021Updated 4 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 6 years ago
- Architecture principles☆13May 23, 2025Updated 9 months ago
- GA Grid (Beta) is a distributive in memory Genetic Algorithm (GA) component for Apache Ignite. A GA is a method of solving complex optimi…☆11Nov 14, 2017Updated 8 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- The Data Product Specification☆11Jan 28, 2025Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Sveltekit + Tailwind + DaisyUI☆13Feb 17, 2023Updated 3 years ago
- Rob Pike's examples from the "Go Concurrency Patterns" talk, but in Rust☆13Jul 9, 2022Updated 3 years ago
- next auth adapter for authentication over http☆10Aug 17, 2023Updated 2 years ago
- SQL for Redis☆11Sep 16, 2022Updated 3 years ago
- Scala HTTP/SOCKS proxy library, based on akka-streams☆10Nov 3, 2018Updated 7 years ago
- Official implementation of OpenTab (ICLR2024)☆13Mar 27, 2024Updated last year