snowplow-archive / spark-example-projectView external linksLinks
A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR
☆120Mar 28, 2016Updated 9 years ago
Alternatives and similar repositories for spark-example-project
Users that are interested in spark-example-project are comparing it to the libraries listed below
Sorting:
- A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB☆93Oct 1, 2020Updated 5 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- HashCats Auto Clicker is a versatile tool that enhances your gaming experience by automating various actions within the HashCats game☆18Updated this week
- ☆33Jan 9, 2016Updated 10 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- This repository hold the Amazon Elastic MapReduce sample bootstrap actions☆613Jun 5, 2023Updated 2 years ago
- ☆92Apr 17, 2017Updated 8 years ago
- Prescriptive Applications over Kite and Hadoop☆12Oct 14, 2015Updated 10 years ago
- Practical examples of using Apache Spark in several different use cases☆102Jun 29, 2016Updated 9 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆20Jan 11, 2018Updated 8 years ago
- Lagom eye for the Akka guy☆13Mar 14, 2016Updated 9 years ago
- Helper for consuming Divolte events from Kafka queues and deserializing Avro records into Java objects using Avro's generated code.☆15Nov 6, 2014Updated 11 years ago
- Apache Spark applications☆70Dec 17, 2017Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆13May 3, 2019Updated 6 years ago
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- ☆16Sep 17, 2017Updated 8 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆75Jan 30, 2017Updated 9 years ago
- A set of tools for copying and streaming data from MongoDB into HBase☆28Jan 27, 2014Updated 12 years ago
- A Locality-Sensitive Hashing Library for Scala with optional Redis storage.☆16Jan 5, 2022Updated 4 years ago
- Scripts and code to import the GDELT dataset into Spark SQL for analysis☆17Aug 29, 2014Updated 11 years ago
- Ambari Service definition for deploying R & RHadoop libraries☆18Aug 3, 2015Updated 10 years ago
- Learning to write Spark examples☆161Aug 20, 2014Updated 11 years ago
- A bash wrapper to help you connect to your instances☆15May 20, 2016Updated 9 years ago
- Low level integration of Spark and Kafka☆130Mar 15, 2018Updated 7 years ago
- scala driver for launching Amazon EMR jobs☆39Feb 10, 2016Updated 10 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Jan 20, 2026Updated 3 weeks ago
- A CKAN extension for US-DCAT and /data pages in Project Open Data implementation☆24Mar 12, 2025Updated 11 months ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Mar 16, 2016Updated 9 years ago
- A Ruby toolkit for cloud-friendly ETL☆38Jul 29, 2016Updated 9 years ago
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- A client/server chat in Java written using Akka remote actors☆24Jan 15, 2011Updated 15 years ago
- Examples for Fast Data Processing with Spark☆59Sep 10, 2013Updated 12 years ago
- Spark Terasort☆121Apr 21, 2023Updated 2 years ago
- This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster☆17Apr 13, 2018Updated 7 years ago
- Apache Spark based ETL Engine☆71Oct 18, 2016Updated 9 years ago
- ☆56Aug 21, 2014Updated 11 years ago
- Repositório público do Professor George Mendes Marra☆101Updated this week
- CMPE352/451 Group 5 repository☆10Dec 21, 2025Updated last month