tmcgrath / spark-with-python-course
Contains source files used in the Spark with Python course
☆18Updated 5 years ago
Alternatives and similar repositories for spark-with-python-course:
Users that are interested in spark-with-python-course are comparing it to the libraries listed below
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 9 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆17Updated 2 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- Repository used for Spark Trainings☆53Updated last year
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- AWS Big Data Certification☆25Updated 2 months ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- ☆26Updated last year
- ☆16Updated last year
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- ☆23Updated 2 years ago
- Learning PySpark video series☆11Updated 7 years ago
- Live Twitter sentiment analysis using Python, Apache Spark Streaming, Kafka, NLTK, SocketIO☆20Updated 7 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆122Updated 2 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- Processing tweets using Spark Streaming and identifying top trending hashtags using a real-time simple dashboard☆41Updated 2 years ago
- Springboard - Data Science Intensive course☆13Updated 8 years ago
- A simple Spark TDD example☆26Updated 7 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago