Test suite to document the behavior of Spark
☆21Apr 15, 2021Updated 5 years ago
Alternatives and similar repositories for spark-spec
Users that are interested in spark-spec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark data profiling utilities☆23Nov 24, 2018Updated 7 years ago
- An example PySpark project with pytest☆18Oct 13, 2017Updated 8 years ago
- Parametrize and run scripts as notebooks with jupytext and papermill☆18Sep 29, 2019Updated 6 years ago
- Example project demonstrating easy, concise and typechecked JDBC access☆10Feb 9, 2018Updated 8 years ago
- Documentation on using the built-in Python debugger, PDB.☆23Dec 8, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Spark functions to run popular phonetic and string matching algorithms☆59Feb 22, 2022Updated 4 years ago
- Rake tasks to add Bootstrap, Font Awesome, and Start Bootstrap Landing Pages to a Rails app☆94Feb 28, 2018Updated 8 years ago
- JumpSpark - A modern cookiecutter template for pyspark projects with batteries included.☆10May 12, 2023Updated 2 years ago
- Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/de…☆10Jun 22, 2020Updated 5 years ago
- A toolset to streamline running spark python on EMR☆20Nov 16, 2016Updated 9 years ago
- Coq BPF interpreter☆19Jan 18, 2018Updated 8 years ago
- Customizable graph algorithms in Scala☆19Jun 21, 2024Updated last year
- JUnit integration for testing the Apache Hive Metastore and HiveServer2 Thrift APIs☆26Jul 22, 2025Updated 9 months ago
- Redux RxJava Observable middleware for Kotlin☆11Sep 18, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- PySpark implementation of the Open Privacy Preserving Record Linkage (OPPRL) specification.☆26Updated this week
- Modeling directed acyclic graphs (DAG) for topological sorting, shortest path, longest path, etc.☆14Sep 1, 2017Updated 8 years ago
- Specs2 bindings for Scalaz☆34Dec 28, 2017Updated 8 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- An example of SparkConnect extension.☆15Mar 5, 2024Updated 2 years ago
- Backup mongodb on Heroku and push it to S3 or FTP with cron task.☆54Oct 13, 2015Updated 10 years ago
- scala driver for launching Amazon EMR jobs☆39Feb 10, 2016Updated 10 years ago
- Speak Slack notifications and process Slack slash commands☆15Dec 20, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆26Feb 22, 2026Updated 2 months ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 5 years ago
- A PDM plugin to sync the exported files with the project file☆15Sep 6, 2025Updated 7 months ago
- Prototype of xml string interpolator for Scala.☆14Mar 28, 2019Updated 7 years ago
- Write property based tests easily on spark dataframes☆21Jan 19, 2024Updated 2 years ago
- A command-line tool that summarizes the size of a codebase by language, showing lines of code with and without comments and blank lines.☆58Mar 6, 2026Updated last month
- Fulfills a GitHub workflow_job webhooks into a Pub/Sub queue.☆12Mar 13, 2025Updated last year
- ☆37Aug 29, 2018Updated 7 years ago
- Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive☆191Oct 15, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Feb 11, 2023Updated 3 years ago
- Particle Syntax Website☆16Apr 12, 2026Updated 3 weeks ago
- HackerNews reader☆10Nov 13, 2015Updated 10 years ago
- Tutorial session from PyData London, Fri 6 May 2016☆11May 6, 2016Updated 9 years ago
- ☆25Feb 18, 2026Updated 2 months ago
- A stream search engine for the Internet of Things (back-end)☆30Sep 1, 2014Updated 11 years ago
- [R package]: Datasets from "Applied Logistic Regression" by Hosmer D.W., Lemeshow S. and Sturdivant X.☆12Nov 3, 2025Updated 6 months ago