Capture the logical plan from Spark (SQL)
☆22Mar 6, 2021Updated 5 years ago
Alternatives and similar repositories for SparkDataLineageCapture
Users that are interested in SparkDataLineageCapture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of a Big Data (batch and stream) distributed processing engine in Java using Akka actors.☆12Feb 20, 2023Updated 3 years ago
- A facebook for data☆26May 31, 2019Updated 6 years ago
- Spark SQL listener to record lineage information☆28Jan 24, 2021Updated 5 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 7 months ago
- A platform to manage the data product life cycle☆22Mar 25, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An sbt plugin for publishing packages to AWS CodeArtifact.☆26May 29, 2024Updated last year
- Xenon is a WebDriver proxy, for running multiple WebDriver sessions through a single hub☆12May 2, 2026Updated 2 weeks ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- Fast, reliable, and scalable channels implementation based on Redis streams.☆11Jun 25, 2024Updated last year
- software transactional memory in rust☆14Jul 20, 2021Updated 4 years ago
- A simple golang job queue☆13Jan 19, 2023Updated 3 years ago
- The Ray Tracer Challenge by Jamis Buck written in Rust. Broken down chapter by chapter.☆11Feb 27, 2026Updated 2 months ago
- spark 字段血缘 spark field lineage☆32Jun 7, 2022Updated 3 years ago
- Exposes Redis stream through the command line☆12Jun 28, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- Code for the book - Practical Redis☆18Jan 5, 2019Updated 7 years ago
- Scala HTTP/SOCKS proxy library, based on akka-streams☆10Nov 3, 2018Updated 7 years ago
- Java Alerting Framework for ElasticSearch☆12May 20, 2016Updated 10 years ago
- A Fully HiveServer2-like Multi-tenancy Spark Thrift Server Supporting Impersonation and Multi-SparkContext with Ranger Authorization (GO …☆10Jul 7, 2022Updated 3 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- An implementation of Dijkstra in Clojure☆19Aug 7, 2012Updated 13 years ago
- A timer module for Redis☆11Oct 16, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Rob Pike's examples from the "Go Concurrency Patterns" talk, but in Rust☆13Jul 9, 2022Updated 3 years ago
- Integration of Iceberg table management into Spark SQL☆11Jan 21, 2020Updated 6 years ago
- A bunch of low-level basic methods for data processing and monitoring with Scala Spark☆10Jun 29, 2018Updated 7 years ago
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12May 15, 2026Updated last week
- Cross-platform polyfills.☆19Aug 22, 2023Updated 2 years ago
- Second generation of the ICGC DCC release ETL built on Spark☆10Apr 8, 2019Updated 7 years ago
- 基于netty实现代理服务器☆11Nov 17, 2019Updated 6 years ago
- Pachyderm/MLeap team up to provide versioned datasets + models☆10Jun 7, 2017Updated 8 years ago
- Redis search and indexing in Java☆16Sep 26, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grafana dashboards for Elasticsearch datasource☆16Feb 18, 2017Updated 9 years ago
- pretend as a redis cluster node which accept RCmb(Redis Cluster message bus) message and play with redis cluster☆13Sep 4, 2025Updated 8 months ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- ☆10Aug 13, 2021Updated 4 years ago
- Code samples, summaries, cheatsheets and other study material for Hadoop MapReduce and Apache Spark☆10Aug 17, 2018Updated 7 years ago