Repository used for Spark Trainings
☆54Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for spark-training
Users that are interested in spark-training are comparing it to the libraries listed below
Sorting:
- ☆20Jun 23, 2019Updated 6 years ago
- Miscellaneous Jupyter notebooks and slides for public talks☆11Jan 7, 2019Updated 7 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 6 years ago
- What makes convnets so powerful at image classification?☆46Nov 21, 2017Updated 8 years ago
- SBT plugins for publishing to Maven Central, shading and managing dependencies, reporting to Coveralls from TravisCI, and more☆14Nov 13, 2020Updated 5 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- Import Salesforce data into Hadoop HDFS in Avro format☆23Jan 8, 2020Updated 6 years ago
- A chess program written in Scala☆19Jul 21, 2020Updated 5 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Oct 20, 2017Updated 8 years ago
- Utilities for writing tests that use Apache Spark.☆24Dec 29, 2018Updated 7 years ago
- Updated repository☆157Nov 25, 2021Updated 4 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Dec 12, 2018Updated 7 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 6 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- Memory consumption estimator for Scala/Java☆26Nov 24, 2014Updated 11 years ago
- Vagrant project to spin up a single node VM running current versions of Hadoop, Hive and Spark☆66Feb 15, 2022Updated 4 years ago
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆71Nov 21, 2016Updated 9 years ago
- My presentation at ODSC India 2018 about Deep Learning with Apache Spark☆27Sep 1, 2018Updated 7 years ago
- Docker image for Jupyter notebooks with PySpark☆27Aug 3, 2018Updated 7 years ago
- Scala utility to send mail☆14May 4, 2020Updated 5 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Jul 24, 2020Updated 5 years ago
- PySpark Code for Hands-on Learners☆117Nov 3, 2019Updated 6 years ago
- Movie recommender system with Collaborative Filtering using PySpark☆28Apr 17, 2017Updated 8 years ago
- PredictorFinc is a scalable supervised machine learning model the predicts stock price change through Decision Tree Regressor using data …☆12Sep 5, 2023Updated 2 years ago
- Quick and simple data visualization tool.☆11Aug 10, 2018Updated 7 years ago
- NSI power site project☆17May 5, 2012Updated 13 years ago
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 7 years ago
- Code snippets and tutorials for working with social science data in PySpark☆418Aug 11, 2017Updated 8 years ago
- BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natura…☆139Feb 17, 2026Updated 2 weeks ago
- Just a boilerplate for PySpark and Flask☆36Aug 2, 2018Updated 7 years ago
- Power Plant ML Pipeline Application - Apache Spark☆12Dec 12, 2016Updated 9 years ago
- Export Tweets from Twitter into JSON file then publish as a Graph objects in Neo4j DB☆10Dec 7, 2018Updated 7 years ago
- 10gen M101J courseware☆15Apr 15, 2013Updated 12 years ago
- RFM (recency, frequency, monetary) analysis☆13Aug 11, 2018Updated 7 years ago
- Computer Science, Data Science and ML Fundamentals☆11May 30, 2025Updated 9 months ago
- Python client for Radarly API☆10Aug 3, 2023Updated 2 years ago