Repository used for Spark Trainings
☆54Apr 21, 2023Updated 3 years ago
Alternatives and similar repositories for spark-training
Users that are interested in spark-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SBT plugins for publishing to Maven Central, shading and managing dependencies, reporting to Coveralls from TravisCI, and more☆14Nov 13, 2020Updated 5 years ago
- Import Salesforce data into Hadoop HDFS in Avro format☆23Jan 8, 2020Updated 6 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 7 years ago
- Apache Spark (PySpark) Practice on Real Data☆272Jan 31, 2020Updated 6 years ago
- Memory consumption estimator for Scala/Java☆27Nov 24, 2014Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- My submissions for the Coursera MOOC "Big Data Analysis with Scala and Spark" given by EPFL.☆52Mar 24, 2017Updated 9 years ago
- Code snippets and tutorials for working with social science data in PySpark☆418Aug 11, 2017Updated 8 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆265Jul 24, 2020Updated 5 years ago
- Project for James' Apache Spark with Scala course☆124Jul 6, 2020Updated 5 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- A demonstration of Jupyter Book functionality using QuantEcon Python programming source material.☆14Oct 30, 2020Updated 5 years ago
- Go Production Deployments [Video], published by Packt☆11Jan 14, 2021Updated 5 years ago
- Chef Fundamentals: A Recipe for Automating Infrastructure Udemy course resources including PDF's and code examples☆11Apr 15, 2020Updated 6 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Oct 20, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code Repository for Hyperledger for Blockchain Applications, by Packt Publishing☆13Jan 12, 2023Updated 3 years ago
- Examples of diagrams using Mermaid: https://mermaid.js.org/intro/☆12Mar 25, 2023Updated 3 years ago
- ETL with Azure Cookbook, published by Packt☆12Jan 18, 2023Updated 3 years ago
- HMM Tutorial☆12Apr 15, 2018Updated 8 years ago
- Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps☆13Sep 30, 2021Updated 4 years ago
- Updated repository☆156Nov 25, 2021Updated 4 years ago
- Miscellaneous Jupyter notebooks and slides for public talks☆11Jan 7, 2019Updated 7 years ago
- Spark app to merge different schemas☆23Dec 21, 2020Updated 5 years ago
- ☆11Apr 15, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A chess program written in Scala☆19Jul 21, 2020Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 2 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- Build a Docker container to build, train and deploy fast.ai based Deep Learning models with Amazon SageMaker☆13Dec 15, 2018Updated 7 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- ☆13Aug 5, 2024Updated last year
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.☆14Feb 4, 2025Updated last year
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Dec 12, 2018Updated 7 years ago
- Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References☆69Jan 21, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.☆673Jul 9, 2022Updated 3 years ago
- Sample Code for Thoughtful Data Science book☆15Dec 9, 2018Updated 7 years ago
- ☆14Sep 14, 2021Updated 4 years ago
- This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language☆566Mar 20, 2024Updated 2 years ago
- PDF to JSON, JSON to PDF and etc.☆12Apr 18, 2018Updated 8 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆363Oct 29, 2022Updated 3 years ago