ksbg / sparklanesLinks
A lightweight data processing framework for Apache Spark
☆16Updated 2 years ago
Alternatives and similar repositories for sparklanes
Users that are interested in sparklanes are comparing it to the libraries listed below
Sorting:
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Repository used for Spark Trainings☆54Updated 2 years ago
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- scaffold of Apache Airflow executing Docker containers☆86Updated 2 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Asynchronous actions for PySpark☆47Updated 3 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 9 years ago
- code, labs and lectures for the course☆47Updated 2 years ago
- ☆19Updated 4 years ago
- Workshop for Spark and Databricks☆54Updated 5 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆86Updated 5 years ago
- ☆59Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Updated repository☆157Updated 3 years ago
- ☆54Updated 2 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 6 years ago
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Updated 9 years ago
- ☆16Updated 7 years ago
- Spark 2.0 Python Machine Learning examples☆97Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 7 years ago
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- Repository with examples and smoke tests for the GCP Airflow operators and hooks☆149Updated 8 years ago