Apache Spark applications
☆70Dec 17, 2017Updated 8 years ago
Alternatives and similar repositories for SparkApps
Users that are interested in SparkApps are comparing it to the libraries listed below
Sorting:
- Spark examples☆41May 7, 2024Updated last year
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 4 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- Spark to Tableau Extractor library☆19Oct 23, 2017Updated 8 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Feb 21, 2014Updated 12 years ago
- ☆11Dec 10, 2015Updated 10 years ago
- ☆51Aug 13, 2015Updated 10 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Example project to show how to use Kafka from Spark Streaming with the Confluent schema registry☆11Aug 17, 2016Updated 9 years ago
- Hortonworks Data Platform Data Generation Tool☆13Nov 30, 2017Updated 8 years ago
- Spark TS Examples☆123Dec 17, 2023Updated 2 years ago
- A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.☆10Oct 25, 2015Updated 10 years ago
- Real-time Monitoring☆29May 14, 2012Updated 13 years ago
- Workshop for Hadoop Operations Best Practices☆10Feb 24, 2015Updated 11 years ago
- Email Analysis Tool based on Hadoop☆20Apr 26, 2021Updated 4 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Nov 30, 2014Updated 11 years ago
- A Spark SQL HBase connector☆29May 4, 2015Updated 10 years ago
- Spark1.6和spark2.2的示例,包含kafka,flume,structuredstreaming,jedis,elasticsearch,mysql,dataframe☆15Jan 28, 2018Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Mar 23, 2016Updated 9 years ago
- Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces☆315Apr 12, 2022Updated 3 years ago
- Make the Guice EDSL more Scala friendly☆45Oct 26, 2017Updated 8 years ago
- Examples of using SparklingPandas and Pandas with PySpark☆16Aug 6, 2015Updated 10 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆120Mar 28, 2016Updated 9 years ago
- Presto SQL query formatter☆15Jan 1, 2024Updated 2 years ago
- 基于 spark 混合查询平台,支持不同源数据库的联合查询,mysql hive presto ...☆14Aug 3, 2017Updated 8 years ago
- Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.☆20Feb 20, 2026Updated last week
- Recipes and examples for Apache Spark☆13Jan 21, 2015Updated 11 years ago
- ☆33Jan 9, 2016Updated 10 years ago
- ☆243Jun 14, 2018Updated 7 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- Profiling Spark Applications for Performance Comparison and Diagnosis☆17Nov 11, 2018Updated 7 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- Db2 JDBC connector for Trino☆19Jan 6, 2023Updated 3 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- Apache Spark Web Monitor Tool, varOne☆36Aug 26, 2016Updated 9 years ago
- Akka cluster + Docker + CoreOS☆25Sep 13, 2014Updated 11 years ago
- An Apache Flume Sink implementation to publish data to Apache pulsar☆21Oct 5, 2022Updated 3 years ago
- Scalable CDC Pattern Implemented using PySpark☆18Oct 8, 2025Updated 4 months ago