Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
☆54Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for Spark
Users that are interested in Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆11Mar 27, 2017Updated 9 years ago
- Learning to write Spark examples☆160Aug 20, 2014Updated 11 years ago
- Experiments made with Spark☆15Dec 9, 2014Updated 11 years ago
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- ☆20Aug 17, 2019Updated 6 years ago
- Facebook makes it even easier to interact with Facebook's Graph API☆21Oct 10, 2015Updated 10 years ago
- ☆30Jun 18, 2017Updated 8 years ago
- 大数据下的移动 APP 数据分析指南☆13Jun 27, 2019Updated 6 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- Projects from my Hadoop training sessions☆16Feb 22, 2018Updated 8 years ago
- Spark Streaming HBase Example☆94Apr 4, 2016Updated 10 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- Scripts and code to import the GDELT dataset into Spark SQL for analysis☆17Aug 29, 2014Updated 11 years ago
- Machine Learning based model to predict Insurance Pure Premium☆13Jan 24, 2017Updated 9 years ago
- ☆12May 11, 2016Updated 9 years ago
- QA dashboard for DV360 advertisers☆13Jan 20, 2021Updated 5 years ago
- Spark and Python (PySpark) Examples☆39Jul 7, 2021Updated 4 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- Mirror of Apache Horn (Incubating) ** This project has been retired **☆28Apr 28, 2017Updated 9 years ago
- Spark(multi versions) + Streaming/Hive/SQL/UDF Demos☆15May 17, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Real-world Spark pipelines examples☆82Feb 27, 2018Updated 8 years ago
- Examples for Spark Training in chinahadoop.cn☆139Feb 18, 2018Updated 8 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Automate claim approval in personal insurance sector.☆20Apr 21, 2016Updated 10 years ago
- Examples for High Performance Spark☆530Updated this week
- ☆26Mar 18, 2016Updated 10 years ago
- ☆14Sep 16, 2013Updated 12 years ago
- A few, straightforward examples which shows how to use Typesafe's Config library and HOCON.☆10Oct 9, 2013Updated 12 years ago
- A reusable workflow to show how to orchestrate many iterations of an action concurrently, in a single pane of glass. See medium write-up …☆12Nov 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Examples of Spark 2.0☆214Aug 11, 2021Updated 4 years ago
- Use maven-assembly-plugin to package a spring boot project into a non-fat jar☆10Jul 24, 2017Updated 8 years ago
- Code to support Databases blog post - How to offload data from your transactional NoSQL database to Amazon S3, perform advanced analytics…☆15Mar 26, 2020Updated 6 years ago
- ☆11Aug 14, 2014Updated 11 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- Source code of Blog at☆51Sep 17, 2025Updated 7 months ago
- Predict why are our best and most experienced employees leaving prematurely? using machine learning.☆20Jan 21, 2017Updated 9 years ago