Test suite to document the behavior of Spark
☆21Apr 15, 2021Updated 4 years ago
Alternatives and similar repositories for spark-spec
Users that are interested in spark-spec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- Parametrize and run scripts as notebooks with jupytext and papermill☆18Sep 29, 2019Updated 6 years ago
- Shed light on your data layout in order to monitor the health of your Lakehouse tables and identify when data maintenance operations shou…☆10Jul 31, 2023Updated 2 years ago
- Example project demonstrating easy, concise and typechecked JDBC access☆10Feb 9, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 4 years ago
- Material for the Jupytext+Papermill blog post☆31Jun 30, 2020Updated 5 years ago
- Rake tasks to add Bootstrap, Font Awesome, and Start Bootstrap Landing Pages to a Rails app☆94Feb 28, 2018Updated 8 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Optics for Spark DataFrames☆47Mar 5, 2021Updated 5 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- PySpark implementation of the Open Privacy Preserving Record Linkage (OPPRL) specification.☆23Nov 7, 2025Updated 4 months ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Backup mongodb on Heroku and push it to S3 or FTP with cron task.☆55Oct 13, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- eep.erl - Embedded Event Processing☆37Jul 3, 2015Updated 10 years ago
- scala driver for launching Amazon EMR jobs☆39Feb 10, 2016Updated 10 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆455Feb 8, 2026Updated last month
- A PDM plugin to sync the exported files with the project file☆15Sep 6, 2025Updated 6 months ago
- Dyna Blaster in Java☆15Jun 15, 2011Updated 14 years ago
- A giter8 template for Spark SBT projects☆72Mar 20, 2021Updated 5 years ago
- Prototype of xml string interpolator for Scala.☆14Mar 28, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Write property based tests easily on spark dataframes☆20Jan 19, 2024Updated 2 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆51Mar 6, 2026Updated 2 weeks ago
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- ☆37Aug 29, 2018Updated 7 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆187Oct 15, 2025Updated 5 months ago
- outcasts no longer allowed in the ivory tower☆28Aug 25, 2015Updated 10 years ago
- Minimal examples of data structures and algorithms in Scala☆24May 3, 2019Updated 6 years ago
- App from Udemy Ionic course that is similar to Airbnb. Firebase 'ionic-maps-api' project deleted. Would need a new Firebase project cre…☆15Feb 4, 2023Updated 3 years ago
- Particle Syntax Website☆16Sep 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A library you can include in your Spark job to validate the counters and perform operations on success. Goal is scala/java/python support…☆109Feb 1, 2018Updated 8 years ago
- Reactive Apps with Spring 5☆14Dec 13, 2017Updated 8 years ago
- Mesos on Mesos☆15Mar 11, 2015Updated 11 years ago
- ☆10Nov 27, 2016Updated 9 years ago
- [R package]: Datasets from "Applied Logistic Regression" by Hosmer D.W., Lemeshow S. and Sturdivant X.☆12Nov 3, 2025Updated 4 months ago
- The DAMN (Data Assets Metric Navigation) tool extracts and reports metrics about your data assets☆11Dec 27, 2024Updated last year
- Open data for mobility in the Greater Oslo area☆10Oct 1, 2019Updated 6 years ago