SimpleDataLabsInc / prophecy-build-toolLinks
Prophecy-built-tool (PBT) allows you to quickly build projects generated by Prophecy (your standard Spark Scala and PySpark pipelines) to integrate them with your own CI / CD (e.g. Github Actions), build system (e.g. Jenkins), and orchestration (e.g. Databricks Workflows).
☆28Updated 2 weeks ago
Alternatives and similar repositories for prophecy-build-tool
Users that are interested in prophecy-build-tool are comparing it to the libraries listed below
Sorting:
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆808Updated 2 weeks ago
- Essential Spark extensions and helper methods ✨😲☆766Updated 4 months ago
- Qubole Sparklens tool for performance tuning Apache Spark☆587Updated last year
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆453Updated 2 weeks ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆360Updated 8 years ago
- Avro SerDe for Apache Spark structured APIs.☆240Updated 7 months ago
- Base classes to use when writing tests with Spark☆1,549Updated last month
- ☆314Updated 7 years ago
- The Internals of Apache Spark☆1,538Updated 6 months ago
- The iterative broadcast join example code.☆70Updated 8 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Updated 2 years ago
- A Spark plugin for reading and writing Excel files☆519Updated last month
- ☆248Updated 6 years ago
- Spark style guide☆272Updated last year
- Apache Spark™ and Scala Workshops☆264Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.☆675Updated 3 years ago
- ☆130Updated 8 years ago
- Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.☆942Updated last week
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,370Updated 2 years ago
- Kerberos and Hadoop: The Madness beyond the Gate☆281Updated 2 years ago
- The Internals of Spark SQL☆483Updated this week
- ☆23Updated last year
- Netezza Connector for Apache Spark☆13Updated 7 years ago
- Cloudera Hadoop for Developers☆15Updated 10 years ago
- Spark connector for SFTP☆98Updated 2 years ago
- Yet Another SPark Framework☆10Updated 2 years ago
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 3 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Updated 3 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆185Updated 3 months ago