dhutchis / d4mBB
Introducing D4M with Baseball analytics
☆17Updated 11 years ago
Alternatives and similar repositories for d4mBB:
Users that are interested in d4mBB are comparing it to the libraries listed below
- This project describes the D4M 2.0 Schema used in many Accumulo systems.☆21Updated 4 years ago
- Dynamic Distributed Dimensional Data Model☆42Updated last year
- Apache Spark OpenCPU Executor (ROSE)☆26Updated 6 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- ☆61Updated 7 months ago
- Simplify getting Zeppelin up and running☆56Updated 8 years ago
- Example program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.☆38Updated 2 years ago
- Data Science with Apache Spark and Spark Notebook☆30Updated 7 years ago
- ☆53Updated last year
- Routines and data structures for using isarn-sketches idiomatically in Apache Spark☆29Updated 11 months ago
- ☆41Updated 7 years ago
- ☆12Updated 9 years ago
- ☆15Updated 7 years ago
- Hadoop Data Pipeline using Falcon☆15Updated 9 years ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Bucketing and partitioning system for Parquet☆30Updated 6 years ago
- Presto Accumulo Integration☆24Updated last year
- JDBC driver for data.world☆18Updated 7 months ago
- Generate Avro schema and Avro binary from XSD schema and XML☆68Updated 8 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- control spark-shell from vim☆11Updated 8 years ago
- Examples of spark-lucenerdd☆15Updated last year
- A NiFi client library for JVM languages☆13Updated 9 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆16Updated 5 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- SparkAtScale☆11Updated 8 years ago
- Apache Fluo Muchos☆26Updated 5 months ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- docker image with spark and zeppelin☆12Updated 5 years ago