Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
Alternatives and similar repositories for spark-tests
Users that are interested in spark-tests are comparing it to the libraries listed below
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 5 years ago
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆73Mar 14, 2021Updated 4 years ago
- Azure Synapse Analytics Samples☆14Feb 15, 2023Updated 3 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Cloud Spanner Connector for Apache Spark☆17Feb 23, 2026Updated last week
- All Certification and preparation, examples & others☆12Oct 18, 2018Updated 7 years ago
- Amazon Kinesis Source for Structured Streaming☆12Nov 6, 2017Updated 8 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Jan 14, 2021Updated 5 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 6 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue University☆12Feb 17, 2017Updated 9 years ago
- Cloudera Manager datasource for Grafana 3.x☆19Jun 28, 2023Updated 2 years ago
- ScalaCheck for Spark☆63Apr 2, 2018Updated 7 years ago
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆25Dec 4, 2025Updated 2 months ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- Realistic sample value generators for Scala.☆16Jul 4, 2024Updated last year
- A pure python mock of pyspark's RDD☆27Jun 22, 2018Updated 7 years ago
- Scripts for parsing / making sense of yarn logs☆52Aug 22, 2016Updated 9 years ago
- A colorful ls command, with awesome icons.☆30Updated this week
- Run spark calculations from Ammonite☆117Feb 20, 2026Updated last week
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- This is a http metrics reporter for kafka using Jetty with the Codahale metrics servlets (http://metrics.codahale.com/manual/servlets/kaf…☆37Jul 25, 2017Updated 8 years ago
- explore kafka, spark, fs2 and pure functional programming in scala☆34Updated this week
- Presentations and other resources.☆37Jul 13, 2020Updated 5 years ago
- Scala wrapper around the Google Sheets API☆33Jul 4, 2024Updated last year
- ☆13Nov 10, 2025Updated 3 months ago
- Create your landing page with all the common features: A release date counter, email subscribers, social integration, segmentation, etc.☆43Nov 5, 2025Updated 3 months ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Nov 16, 2022Updated 3 years ago
- Code snippets used in demos recorded for the blog.☆38Feb 17, 2026Updated 2 weeks ago
- Easy note taking in Vim☆19Nov 6, 2015Updated 10 years ago
- Everything which has to do with Data Integration. Templates for Azure Data Factory and Azure Synapse Analytics☆10Jan 29, 2022Updated 4 years ago
- This is a list of YAML file examples for Docker, Kubernetes, Ansible. Also includes a Python script.☆10Jan 12, 2021Updated 5 years ago
- Package provides java implementation of the latent dirichlet allocation (LDA) for topic modelling☆10May 18, 2017Updated 8 years ago
- phData Pulse application log aggregation and monitoring☆13Apr 13, 2020Updated 5 years ago
- web crawler☆14Sep 27, 2022Updated 3 years ago
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- HBase tailored but otherwise generic JMXToolkit.☆28Jul 6, 2016Updated 9 years ago