sujee81/SparkApps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sujee81/SparkApps)

sujee81 / SparkApps

Apache Spark applications

☆70

Alternatives and similar repositories for SparkApps

Users that are interested in SparkApps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

looker / spark_log_data
View on GitHub
Flume-to-Spark-Streaming Log Parser
☆23Jun 3, 2016Updated 10 years ago
dkuppitz / openflights
View on GitHub
Sample migration from Titan 0.5.4 to Titan 1.0.0
☆17Feb 4, 2016Updated 10 years ago
sequenceiq / yarn-monitoring
View on GitHub
Hadoop YARN monitoring with R
☆19Sep 16, 2014Updated 11 years ago
EasyPost / syslog-rfc5424-parser
View on GitHub
A small Python module to parse RFC5424-formatted Syslog messages
☆37Oct 17, 2025Updated 9 months ago
yamrcraft / etl-light
View on GitHub
A light Kafka to HDFS/S3 ETL library based on Apache Spark
☆40Jun 29, 2017Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
werneckpaiva / spark-to-tableau
View on GitHub
Spark to Tableau Extractor library
☆19Oct 23, 2017Updated 8 years ago
melphi / spark-examples
View on GitHub
Spark examples
☆40May 7, 2024Updated 2 years ago
SparkMonitor / varOne
View on GitHub
Apache Spark Web Monitor Tool, varOne
☆36Aug 26, 2016Updated 9 years ago
OopsOutOfMemory / spark-sql-hbase
View on GitHub
A Spark SQL HBase connector
☆29May 4, 2015Updated 11 years ago
thunderain-project / thunderain
View on GitHub
A Real-Time Analytical Processing (RTAP) example using Spark/Shark
☆51Feb 21, 2014Updated 12 years ago
avensolutions / cdc-at-scale-using-spark
View on GitHub
Scalable CDC Pattern Implemented using PySpark
☆18Oct 8, 2025Updated 9 months ago
ExpediaGroup / hiveberg
View on GitHub
Demonstration of a Hive Input Format for Iceberg
☆26Mar 12, 2021Updated 5 years ago
Huawei-Spark / Spark-SQL-on-HBase
View on GitHub
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
☆316Apr 12, 2022Updated 4 years ago
TechnocratSid / spring-spark-word-count
View on GitHub
Apache Spark’s classic Word Count example with Spring Boot.
☆11Apr 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
koeninger / kafka-exactly-once
View on GitHub
☆242Jun 14, 2018Updated 8 years ago
DataStax-Examples / SparkBuildExamples
View on GitHub
Example projects for using Spark and Cassandra With DSE Analytics
☆59Oct 10, 2025Updated 9 months ago
jinoos / flume-ng-extends
View on GitHub
Source of Flume NG for tailing files in multiple directories
☆24Jun 16, 2015Updated 11 years ago
NikhilSuthar / Scala-Spark-Mail
View on GitHub
Scala utility to send mail
☆14May 4, 2020Updated 6 years ago
sryza / spark-ts-examples
View on GitHub
Spark TS Examples
☆122Dec 17, 2023Updated 2 years ago
rathboma / hive-extension-examples
View on GitHub
Examples for extending hive
☆90Jan 25, 2018Updated 8 years ago
dstreev / hdp-data-gen
View on GitHub
Hortonworks Data Platform Data Generation Tool
☆13Nov 30, 2017Updated 8 years ago
deepsense-ai / seahorse-workflow-executor
View on GitHub
☆41Jul 19, 2017Updated 9 years ago
databricks / simr
View on GitHub
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
☆44Mar 9, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nivdul / spark-in-practice
View on GitHub
Getting started with Spark, Spark Streaming, Spark SQL, DataFrame
☆35Apr 24, 2016Updated 10 years ago
potix2 / spark-google-spreadsheets
View on GitHub
Google Spreadsheets datasource for SparkSQL and DataFrames
☆58Jul 24, 2023Updated 2 years ago
aaalgo / yarn-memory-tracker
View on GitHub
Track app memory usage.
☆11Jan 13, 2015Updated 11 years ago
boozallen / cognition
View on GitHub
Cognition is an open-source platform for data ingest, data fusion and search
☆22Feb 8, 2016Updated 10 years ago
skrusche63 / spark-piwik
View on GitHub
Beyond Piwik Analytics with Scala and Apache Spark
☆46Nov 30, 2014Updated 11 years ago
RetailRocket / SparkMultiTool
View on GitHub
Tools for spark which we use on the daily basis
☆65Jul 2, 2020Updated 6 years ago
hortonworks-spark / cloud-integration
View on GitHub
Spark cloud integration: tests, cloud committers and more
☆20Jan 30, 2025Updated last year
cloudera-labs / SparkOnHBase
View on GitHub
SparkOnHBase
☆278Mar 30, 2021Updated 5 years ago
yohanliyanage / jenkins-spark-deploy
View on GitHub
A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.
☆10Oct 25, 2015Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
databricks / spark-csv
View on GitHub
CSV Data Source for Apache Spark 1.x
☆1,057Dec 13, 2018Updated 7 years ago
arrahtech / osdq-desktop
View on GitHub
The classic desktop version of osDQ
☆10Jun 30, 2022Updated 4 years ago
dmatrix / examples
View on GitHub
These are some code examples
☆56Jan 12, 2020Updated 6 years ago
rucek / akka-streams-in-practice-java8
View on GitHub
Import data from CSV files to Cassandra using Akka Streams with Java 8
☆21May 19, 2017Updated 9 years ago
masayuki038 / calcite-arrow-sample
View on GitHub
calcite-arrow-sample(WIP)
☆13Dec 17, 2017Updated 8 years ago
dhinojosa / scala_core_programming_1
View on GitHub
O'Reilly Scala Programming Fundamentals: Methods, Classes, Traits
☆13Jul 16, 2018Updated 8 years ago
emizell / HBase-Code-Samples
View on GitHub
A series of demos using HBase Standalone and Phoenix/HBase
☆19Apr 10, 2015Updated 11 years ago