☆202Apr 25, 2023Updated 3 years ago
Alternatives and similar repositories for python-spark-tutorial
Users that are interested in python-spark-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆152Apr 4, 2018Updated 8 years ago
- Source code for James Lee's Aparch Spark with Java course☆123Jul 30, 2021Updated 4 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Jan 30, 2023Updated 3 years ago
- Python-Application-Development-Tips-Tricks-and-Techniques [Video]☆13Jan 14, 2021Updated 5 years ago
- Little time-series forecasting app for fun! More models/methods will be included after the june 15! Link: jasonliushiny.shinyapps.io/Forc…☆14Nov 8, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Let's learn Beam, processing Movie Lens 20m datas. Get top three genres for each user☆14Aug 26, 2018Updated 7 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Dec 12, 2018Updated 7 years ago
- Hackney Data Platform Infrastructure and Code☆16Updated this week
- ☆25Apr 6, 2019Updated 7 years ago
- AWS Big Data Certification☆25Mar 26, 2026Updated 2 months ago
- Implementations of Machine Learning algorithms using only numpy with visualizations☆17Aug 14, 2018Updated 7 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆16Oct 31, 2014Updated 11 years ago
- Some experiments and demos of my sessions at live/online events like Data Saturday and Microsoft Conference☆10Updated this week
- Solutions of LeetCode interview questions☆15Feb 7, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Apache Beam example☆26Jan 27, 2021Updated 5 years ago
- Apache Spark docker container image (Standalone mode)☆35Oct 16, 2020Updated 5 years ago
- A boilerplate for writing PySpark Jobs☆393Jan 21, 2024Updated 2 years ago
- This repo outlines a method for differentiating between anomalies and expected outliers using the Microsoft Anomaly Detection API and Bin…☆10Jun 11, 2017Updated 9 years ago
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆22Apr 22, 2020Updated 6 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Mar 22, 2016Updated 10 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Road to Azure Data Engineer Part-II: DP-201 - Designing an Azure Data Solution☆19Aug 16, 2020Updated 5 years ago
- Optionally add multiple selection filters for FK and m2m fields in Django admin changeview☆14Feb 22, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Example of an Oozie workflow with a PySpark action using Python eggs☆14Nov 13, 2016Updated 9 years ago
- Notes on Apache Spark (pyspark)☆299Mar 3, 2019Updated 7 years ago
- Data for BoofCV project☆15Jul 15, 2023Updated 2 years ago
- ☆12Sep 9, 2023Updated 2 years ago
- Because its never late to start taking notes and 'public' it...☆64Jun 3, 2025Updated last year
- Build machine learning models with scikit-learn power tools☆11Oct 28, 2022Updated 3 years ago
- NASA Project in Python [ Tracking the International Space Station ]☆11Sep 18, 2025Updated 8 months ago
- ☆20Aug 20, 2019Updated 6 years ago
- Feature Engineering with Pipeline Talk at ODSC West 2016, Santa Clara☆17Nov 6, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12May 26, 2021Updated 5 years ago
- This Repo contain details related to Data Engineering tech stacks in GCP☆58Apr 18, 2026Updated last month
- ☆13May 23, 2018Updated 8 years ago
- Course content for Practical AI on the Google Cloud Platform☆11Aug 4, 2020Updated 5 years ago
- ☆11Jan 20, 2021Updated 5 years ago
- Amazon EKS example manifests for different workloads that can be deployed to Amazon EKS cluster.☆14Aug 5, 2025Updated 10 months ago
- Covid19 Dashboard India☆12Feb 27, 2021Updated 5 years ago