Utilities for writing tests that use Apache Spark.
☆24Dec 29, 2018Updated 7 years ago
Alternatives and similar repositories for spark-tests
Users that are interested in spark-tests are comparing it to the libraries listed below
Sorting:
- low-level helpers for Apache Spark libraries and tests☆16Dec 29, 2018Updated 7 years ago
- Amazon Kinesis Source for Structured Streaming☆12Nov 6, 2017Updated 8 years ago
- Apache Parquet reader in Scala without Apache Spark - developed at Purdue University☆12Feb 17, 2017Updated 9 years ago
- Cloud Spanner Connector for Apache Spark☆17Updated this week
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Oct 13, 2020Updated 5 years ago
- Realistic sample value generators for Scala.☆16Jul 4, 2024Updated last year
- ScalaCheck for Spark☆63Apr 2, 2018Updated 7 years ago
- Example Porter bundles☆14Oct 13, 2025Updated 5 months ago
- Cloudera Manager datasource for Grafana 3.x☆19Jun 28, 2023Updated 2 years ago
- TileDB integrations for machine learning data and model i/o (PyTorch, TensorFlow, Scikit-Learn)☆25Dec 4, 2025Updated 3 months ago
- A sample monorepo of several Python libraries and commands, using Bazel as build system☆13Oct 11, 2017Updated 8 years ago
- A framework to allow MapReduce applications to use Akka actors☆12Jan 15, 2022Updated 4 years ago
- ☆23Apr 13, 2019Updated 6 years ago
- Deploy microk8s on OpenStack with MetalLB☆12Sep 28, 2022Updated 3 years ago
- An Ansible role to provision CentOS 7 LXC containers on Proxmox integrated with FreeIPA☆12Oct 12, 2023Updated 2 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- We Are Wizards Blog☆19Oct 31, 2016Updated 9 years ago
- Simple implementation of a custom parquet reader/writer☆11Aug 12, 2016Updated 9 years ago
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 7 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Oct 17, 2017Updated 8 years ago
- Example of an async enabled spring mvc web application using rxjava☆14Feb 27, 2016Updated 10 years ago
- Scripts for parsing / making sense of yarn logs☆52Aug 22, 2016Updated 9 years ago
- https://aka.ms/lakehouselab☆23Feb 14, 2023Updated 3 years ago
- ☆10Aug 20, 2018Updated 7 years ago
- Spark/Cassandra/Akka combo to visualize a cloud of words using d3.js☆11Dec 6, 2015Updated 10 years ago
- Trigram tokenizer module for SQLite FTS5☆14Feb 22, 2021Updated 5 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Make Raspberry Pi up and running in a few command☆19Apr 22, 2018Updated 7 years ago
- web crawler☆14Sep 27, 2022Updated 3 years ago
- An http server written in Node.js to send apple push notifications☆55Mar 25, 2024Updated last year
- Docker image for Spark history server on Kubernetes☆15Mar 13, 2020Updated 6 years ago
- An OpenCalais API Interface for Python.☆21Mar 13, 2012Updated 14 years ago
- JSON processing command line tool based on JSONSelect (CSS-like selectors for JSON)☆43Sep 28, 2015Updated 10 years ago
- This is a http metrics reporter for kafka using Jetty with the Codahale metrics servlets (http://metrics.codahale.com/manual/servlets/kaf…☆37Jul 25, 2017Updated 8 years ago
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- Test suite to document the behavior of Spark☆21Apr 15, 2021Updated 4 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Feb 13, 2020Updated 6 years ago
- My terminal setup and config files☆14Oct 30, 2018Updated 7 years ago
- ☆11Jun 29, 2018Updated 7 years ago