Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
Alternatives and similar repositories for spark-tests
Users that are interested in spark-tests are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 6 years ago
- Amazon Kinesis Source for Structured Streaming☆12Nov 6, 2017Updated 8 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue University☆12Feb 17, 2017Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Cloud Spanner Connector for Apache Spark☆18May 9, 2026Updated last week
- Azure Synapse Analytics Samples☆14Feb 15, 2023Updated 3 years ago
- All Certification and preparation, examples & others☆11Oct 18, 2018Updated 7 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆30Oct 13, 2020Updated 5 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Jan 14, 2021Updated 5 years ago
- Realistic sample value generators for Scala.☆16Jul 4, 2024Updated last year
- ScalaCheck for Spark☆63Apr 2, 2018Updated 8 years ago
- Example Porter bundles☆14Oct 13, 2025Updated 7 months ago
- Cloudera Manager datasource for Grafana 3.x☆19Jun 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Spark with Scala example projects☆34Apr 17, 2019Updated 7 years ago
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆26Dec 4, 2025Updated 5 months ago
- Example project demonstrating easy, concise and typechecked JDBC access☆10Feb 9, 2018Updated 8 years ago
- Dynamically loads bundled JNI libraries based on the runtime platform.☆10Dec 19, 2014Updated 11 years ago
- A framework to allow MapReduce applications to use Akka actors☆12Jan 15, 2022Updated 4 years ago
- A colorful ls command, with awesome icons.☆30Updated this week
- An Ansible role to provision CentOS 7 LXC containers on Proxmox integrated with FreeIPA☆12Oct 12, 2023Updated 2 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- Simple implementation of a custom parquet reader/writer☆11Aug 12, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 8 years ago
- Every language system in production at SAiaPS must conform to this API.☆11Apr 26, 2018Updated 8 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Scripts for parsing / making sense of yarn logs☆52Aug 22, 2016Updated 9 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- Trigram tokenizer module for SQLite FTS5☆14Feb 22, 2021Updated 5 years ago
- Make Raspberry Pi up and running in a few command☆19Apr 22, 2018Updated 8 years ago
- An http server written in Node.js to send apple push notifications☆54Mar 25, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- automatic visual data explorer for datasette☆14Apr 20, 2023Updated 3 years ago
- Run spark calculations from Ammonite☆117Apr 22, 2026Updated last month
- An OpenCalais API Interface for Python.☆21Mar 13, 2012Updated 14 years ago
- JSON processing command line tool based on JSONSelect (CSS-like selectors for JSON)☆43Sep 28, 2015Updated 10 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Nov 2, 2015Updated 10 years ago
- This is a http metrics reporter for kafka using Jetty with the Codahale metrics servlets (http://metrics.codahale.com/manual/servlets/kaf…☆37Jul 25, 2017Updated 8 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 5 years ago