Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
☆54Nov 16, 2022Updated 3 years ago
Alternatives and similar repositories for Spark
Users that are interested in Spark are comparing it to the libraries listed below
Sorting:
- ScalaIO 2014 Workshop☆25Oct 23, 2014Updated 11 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆10Mar 27, 2017Updated 8 years ago
- Spark implementation of Slowly Changing Dimension type 2☆11Jan 8, 2019Updated 7 years ago
- Experiments made with Spark☆15Dec 9, 2014Updated 11 years ago
- Scala examples for learning to use Spark☆445Sep 17, 2020Updated 5 years ago
- Predict why are our best and most experienced employees leaving prematurely? using machine learning.☆21Jan 21, 2017Updated 9 years ago
- Automate claim approval in personal insurance sector.☆20Apr 21, 2016Updated 9 years ago
- An example stand alone program to import CSV files into Apache Cassandra using Apache Spark☆19May 28, 2015Updated 10 years ago
- An example of using sub-projects in a Scala/SBT project☆35Sep 28, 2017Updated 8 years ago
- ☆20Aug 17, 2019Updated 6 years ago
- Scripts and code to import the GDELT dataset into Spark SQL for analysis☆17Aug 29, 2014Updated 11 years ago
- Learning to write Spark examples☆161Aug 20, 2014Updated 11 years ago
- Spark and Python (PySpark) Examples☆39Jul 7, 2021Updated 4 years ago
- A machine learning algorithm written to predict severity of insurance claim☆19Nov 14, 2016Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Real-world Spark pipelines examples☆83Feb 27, 2018Updated 8 years ago
- A Spark WordCount example as a standalone SBT project☆17Dec 16, 2015Updated 10 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- ☆195Jun 21, 2022Updated 3 years ago
- Project for James' Apache Spark with Scala course☆125Jul 6, 2020Updated 5 years ago
- Because its never late to start taking notes and 'public' it...☆63Jun 3, 2025Updated 9 months ago
- ☆25Oct 12, 2016Updated 9 years ago
- Tools for spark which we use on the daily basis☆65Jul 2, 2020Updated 5 years ago
- This repository has the code from the text and the videos for "Introduction to Programming and Problem Solving using Scala".☆30Feb 11, 2018Updated 8 years ago
- Example of use of Spark Streaming with Kafka☆90Jul 11, 2014Updated 11 years ago
- Gedcom file Viewer☆13Feb 15, 2016Updated 10 years ago
- IntegratedML samples to be used as a template☆11Dec 27, 2025Updated 2 months ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Aug 21, 2013Updated 12 years ago
- Examples for High Performance Spark☆527Updated this week
- Embedded control system (ECS) software controls the overall behavior of ScanBot3D, an autonomous 3D reconstruction robot☆11Nov 1, 2018Updated 7 years ago
- Examples of Spark 2.0☆212Aug 11, 2021Updated 4 years ago
- Examples for Spark Training in chinahadoop.cn☆139Feb 18, 2018Updated 8 years ago
- ☆30Jun 18, 2017Updated 8 years ago
- Documentation sources for syslog-ng Open Source Edition (https://github.com/syslog-ng/syslog-ng)☆10May 6, 2024Updated last year
- A clean online résumé (CV)☆13Jun 6, 2024Updated last year
- data sanitation services☆12Dec 18, 2024Updated last year
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago
- Developed a recommendation system in Python using Netflix prize dataset and MovieLens data set using collaborative filtering technique to…☆11Aug 16, 2018Updated 7 years ago
- Image-Based Mesh Generation☆13Apr 7, 2024Updated last year