ksbg / sparklanes
A lightweight data processing framework for Apache Spark
☆16Updated 2 years ago
Alternatives and similar repositories for sparklanes:
Users that are interested in sparklanes are comparing it to the libraries listed below
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Updated 9 years ago
- Repository used for Spark Trainings☆53Updated last year
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- ☆25Updated 6 years ago
- Deep Learning with Apache Spark and Deep Cognition☆59Updated 6 years ago
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Updated repository☆157Updated 3 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆84Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- PySpark Cookbook, published by Packt☆91Updated 2 years ago
- Real-time report dashboard with Apache Kafka, Apache Spark Streaming and Node.js☆50Updated last year
- Real-world Spark pipelines examples☆83Updated 7 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated last year
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- ☆37Updated 5 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- ☆19Updated 4 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Learn the pyspark API through pictures and simple examples☆170Updated 4 years ago
- ☆59Updated 3 years ago
- ☆16Updated 7 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Code examples and docker environment for Spark☆27Updated 9 years ago
- Python API for Deequ☆41Updated 4 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Code for Packt Publishing's Spark for Data Science Cookbook.☆22Updated 7 years ago