datamindedbe / python-and-spark-for-data-analysisLinks
A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
☆11Updated 9 years ago
Alternatives and similar repositories for python-and-spark-for-data-analysis
Users that are interested in python-and-spark-for-data-analysis are comparing it to the libraries listed below
Sorting:
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Updated 9 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- A simple introduction to using spark ml pipelines☆26Updated 7 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆35Updated 5 years ago
- ☆18Updated 3 years ago
- Guide for applying Unit Testing in data-driven projects☆19Updated 5 years ago
- ☆19Updated 4 years ago
- 📚 Learn ML with clean code, simplified math and illustrative visuals. As you learn, work on interesting projects and share them on https…☆12Updated 5 years ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Projects developed by Domino's R&D team☆76Updated 3 years ago
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- A curated list of references for MLOps☆13Updated 4 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Code repository supporting the medium blog☆12Updated 5 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- ELT Code for your Data Warehouse☆26Updated last year
- Useful scripts and notebooks for Data Science. The project was made by Miquido. https://www.miquido.com/☆9Updated last year
- Build machine learning models with scikit-learn power tools☆11Updated 2 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- MLinProduction SageMaker workshop hosted in April 2020☆15Updated 5 years ago
- Hands-On Data Analysis with Scala, published by Packt☆20Updated 2 years ago
- ☆16Updated 2 years ago
- ☆16Updated 4 years ago
- ☆11Updated 6 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆53Updated 3 years ago
- ☆25Updated 7 years ago