snowplow-archive/spark-streaming-example-project

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/snowplow-archive/spark-streaming-example-project)

snowplow-archive / spark-streaming-example-project

A Spark Streaming job reading events from Amazon Kinesis and writing event counts to DynamoDB

☆94

Alternatives and similar repositories for spark-streaming-example-project

Users that are interested in spark-streaming-example-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

snowplow-archive / spark-example-project
View on GitHub
A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR
☆119Mar 28, 2016Updated 10 years ago
snowplow-archive / icebucket
View on GitHub
UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage
☆14Sep 10, 2015Updated 10 years ago
dleung / spark-streaming-kafka-example
View on GitHub
An example project using Spark Streaming with Kafka message and Avro serialization
☆12Aug 21, 2015Updated 10 years ago
thunderain-project / thunderain
View on GitHub
A Real-Time Analytical Processing (RTAP) example using Spark/Shark
☆51Feb 21, 2014Updated 12 years ago
gwenshap / SparkStreamingExample
View on GitHub
☆55Aug 21, 2014Updated 11 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
CodeRayZhang / Spark-Example
View on GitHub
Spark1.6和spark2.2的示例，包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe
☆15Jan 28, 2018Updated 8 years ago
implydata / druid-hadoop-inputformat
View on GitHub
Hadoop InputFormat for http://druid.io/
☆10Oct 26, 2016Updated 9 years ago
Gschiavon / Kafka-SparkStreaming-HDFS
View on GitHub
☆14Nov 3, 2016Updated 9 years ago
aseigneurin / kafka-sandbox
View on GitHub
☆48Feb 4, 2018Updated 8 years ago
caroljmcdonald / mapr-streams-sparkstreaming-hbase
View on GitHub
☆24Apr 29, 2016Updated 10 years ago
snowplow-archive / kinesis-tee
View on GitHub
Unix tee, but for Kinesis streams
☆12Oct 19, 2021Updated 4 years ago
marceloemanoel / play-sb-admin2
View on GitHub
Play framework template based on SB-Admin-2
☆13Mar 13, 2015Updated 11 years ago
adriaanm / scala-virtualized-tutorial
View on GitHub
Tutorial programs that illustrate how to use scala-virtualized
☆23Aug 28, 2019Updated 6 years ago
aamend / spark-gdelt
View on GitHub
Binding the GDELT universe in a Spark environment
☆26Apr 21, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
snowplow-incubator / snowplow-google-analytics-plugin
View on GitHub
Google Analytics plugin for sending events to Snowplow
☆17Sep 30, 2020Updated 5 years ago
BenFradet / spark-kafka-writer
View on GitHub
Write your Spark data to Kafka seamlessly
☆172Jul 10, 2024Updated 2 years ago
tresata / spark-kafka
View on GitHub
Low level integration of Spark and Kafka
☆129Mar 15, 2018Updated 8 years ago
krasserm / akka-analytics
View on GitHub
Large-scale event processing with Akka Persistence and Apache Spark
☆271Jun 18, 2016Updated 10 years ago
mkuthan / example-spark-kafka
View on GitHub
Apache Spark and Apache Kafka integration example
☆122Dec 21, 2017Updated 8 years ago
opencore / kafka-spark-avro-example
View on GitHub
Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry
☆11Aug 17, 2016Updated 9 years ago
tmalaska / SparkStreaming.Sessionization
View on GitHub
NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase
☆50Oct 31, 2014Updated 11 years ago
sergey-melnychuk / distributed-algorithms
View on GitHub
Implementation of classic distributed algorithms: membership, failure detection, quorum, replication etc.
☆12Jan 12, 2020Updated 6 years ago
velvia / spark-sql-gdelt
View on GitHub
Scripts and code to import the GDELT dataset into Spark SQL for analysis
☆17Aug 29, 2014Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
awslabs / amazon-kinesis-aggregators
View on GitHub
Amazon Kinesis Aggregators provides a simple way to create real time aggregations of data on Amazon Kinesis.
☆151Apr 30, 2021Updated 5 years ago
chimpler / blog-spark-streaming-log-aggregation
View on GitHub
Example of use of Spark Streaming with Kafka
☆89Jul 11, 2014Updated 12 years ago
memsql / streamliner-starter
View on GitHub
Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 9 years ago
dibbhatt / kafka-spark-consumer
View on GitHub
High Performance Kafka Connector for Spark Streaming.Supports Multi Topic Fetch, Kafka Security. Reliable offset management in Zookeeper.…
☆632Apr 24, 2026Updated 3 months ago
tptodorov / sbt-cloudformation
View on GitHub
SBT plugin for creating and managing AWS CloudFormation stacks
☆11Jan 8, 2018Updated 8 years ago
justinrmiller / spark-kafka-parquet-example
View on GitHub
An example project that combines Spark Streaming, Kafka, and Parquet to transform JSON objects streamed over Kafka into Parquet files in …
☆19Jun 22, 2021Updated 5 years ago
phensley / protobuf-vs-jackson
View on GitHub
Quick benchmark comparing Protocol Buffers 3 vs Jackson JSON
☆14Jul 3, 2015Updated 11 years ago
ahanwadi / paxos
View on GitHub
Implementation of Paxos
☆21Apr 4, 2015Updated 11 years ago
tuplejump / embedded-kafka
View on GitHub
Embedded Kafka for testing and quick prototyping.
☆14Apr 19, 2016Updated 10 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
koeninger / kafka-exactly-once
View on GitHub
☆241Jun 14, 2018Updated 8 years ago
ianoc / SparkEMRBootstrap
View on GitHub
Files to help make new spark EMR Bootstraps
☆15Aug 4, 2013Updated 12 years ago
databricks / als-benchmark-scripts
View on GitHub
Scripts to benchmark distributed Alternative Least Squares (ALS)
☆22Jul 19, 2014Updated 12 years ago
daithiocrualaoich / spark-emr
View on GitHub
Spark Elastic MapReduce bootstrap and runnable examples.
☆17Jun 26, 2013Updated 13 years ago
aws-solutions-library-samples / real-time-analytics-spark-streaming
View on GitHub
A solution describing data-processing design pattern for streaming data through Kinesis and Spark Streaming at real-time.
☆39Jun 11, 2024Updated 2 years ago
ezhulenev / akka-var-calculation
View on GitHub
Akka Cluster for Value-at-Risk calculation
☆13May 2, 2014Updated 12 years ago
vpon / protobuf-to-avro
View on GitHub
dynamically parse protobuf message then convert to avro
☆25May 27, 2015Updated 11 years ago