Data-Driven Spark allows quick data exploration based on Apache Spark.
☆29Jan 6, 2017Updated 9 years ago
Alternatives and similar repositories for spawncamping-dds
Users that are interested in spawncamping-dds are comparing it to the libraries listed below
Sorting:
- Spark package for checking data quality☆223Feb 28, 2020Updated 6 years ago
- Sadnbox of Spark-notebook☆10Mar 19, 2016Updated 9 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Oct 27, 2015Updated 10 years ago
- Spark Streaming with Scala and Akka Activator template☆44Jan 31, 2016Updated 10 years ago
- ☆12May 16, 2017Updated 8 years ago
- Some random how-to examples relating to Databricks.☆15Nov 3, 2021Updated 4 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- Playing with forms in Scala.js☆16May 13, 2016Updated 9 years ago
- A web service for discovery of destinations matching your expected weather conditions (and hints on how to get there).☆32Apr 23, 2016Updated 9 years ago
- ☆16Aug 30, 2025Updated 6 months ago
- Office status board for Pivotal Labs offices using Dashing☆22Mar 24, 2022Updated 3 years ago
- A primal-dual framework for distributed L1-regularized optimization☆37Apr 18, 2016Updated 9 years ago
- Slides for our 2015 event☆13May 14, 2015Updated 10 years ago
- Write SQL in Scala☆30Nov 25, 2025Updated 3 months ago
- Distributed solver library for large-scale structured output prediction, based on Spark. Project website:☆17Mar 3, 2016Updated 9 years ago
- Some Spark implementations of clustering algorithms.☆19Nov 13, 2018Updated 7 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Sep 8, 2016Updated 9 years ago
- analytics tool kit☆42Jan 23, 2017Updated 9 years ago
- Code base for DSLs In Action (http://www.manning.com/ghosh)☆44Dec 5, 2010Updated 15 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Aug 1, 2016Updated 9 years ago
- Pig on Apache Spark☆82Mar 23, 2015Updated 10 years ago
- An SBT plugin for automatically calling Avro code generation and a thin scala wrapper for reading and writing Avro files☆22Mar 8, 2018Updated 7 years ago
- The first Spark firmware library. To exemplify naming conventions and required files.☆29Jan 3, 2018Updated 8 years ago
- Use Material Design Lite components from React in Scala.js!☆24Jan 10, 2017Updated 9 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Feb 15, 2018Updated 8 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Sep 6, 2024Updated last year
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Data quality control tool built on spark and deequ☆25Jan 22, 2026Updated last month
- Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Aug 2, 2015Updated 10 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Jun 19, 2016Updated 9 years ago
- Anonymizing Library for Apache Spark☆31Nov 9, 2023Updated 2 years ago
- Point-in-Time optimizations for Apache Spark☆30Jan 18, 2024Updated 2 years ago
- Stocator is high performing connector to object storage for Apache Spark, achieving performance by leveraging object storage semantics.☆114May 17, 2024Updated last year
- dllib is a distributed deep learning library running on Apache Spark☆32Oct 26, 2017Updated 8 years ago
- ScalaCheck for Spark☆63Apr 2, 2018Updated 7 years ago
- Apache Spark OpenCPU Executor (ROSE)☆26Jun 16, 2018Updated 7 years ago
- Boilerplate project for MOTW Workshop 2015☆10Mar 3, 2016Updated 9 years ago
- Scala framework for collecting performance metrics and conducting sound experimental benchmarking.☆13Nov 19, 2025Updated 3 months ago