A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee
☆61Jul 2, 2018Updated 7 years ago
Alternatives and similar repositories for PySparkCookbook
Users that are interested in PySparkCookbook are comparing it to the libraries listed below
Sorting:
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Code base for the Learning PySpark book (in preparation)☆630Apr 16, 2019Updated 6 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- Material do artigo: Como Criar um Sistema de Recomendação de Produtos Usando Machine Learning☆11Feb 1, 2017Updated 9 years ago
- ☆38May 7, 2025Updated 10 months ago
- A collection of data and codes to supplement the practicalDataAnalysisCookbook (in preparation)☆22Mar 30, 2016Updated 9 years ago
- Apache-kafka-spark-streaming-poc☆10Mar 19, 2017Updated 9 years ago
- ☆16Apr 1, 2025Updated 11 months ago
- My docker dev tools image, usable anywhere where docker is☆10Feb 23, 2015Updated 11 years ago
- demo for qt quick graphical effects☆11Jan 13, 2015Updated 11 years ago
- ☆10Nov 11, 2024Updated last year
- An e-Commerce ChatBot based on contextual-NLP processes, developed with Rasa Open Source, written in python☆11Aug 4, 2022Updated 3 years ago
- Hands-On Big Data Analytics with PySpark, Published by Packt☆37Jan 30, 2023Updated 3 years ago
- Build a real-time website analytics dashboard on GCP using Dataflow, Cloud Memorystore (Redis) and Spring Boot☆30Mar 1, 2026Updated 3 weeks ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,345Dec 7, 2025Updated 3 months ago
- Hands-On Serverless Deep Learning with TensorFlow and AWS Lambda, published by Packt☆14Oct 31, 2022Updated 3 years ago
- ☕⛵WIP PySpark dependency management☆22Jul 8, 2018Updated 7 years ago
- ☆11Oct 5, 2022Updated 3 years ago
- ☆12Apr 20, 2021Updated 4 years ago
- Listing my favorite research papers 📝 from different fields as I read them.☆10Oct 17, 2019Updated 6 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆29Dec 22, 2020Updated 5 years ago
- Chapter 7 of the AWS Cookbook☆12Mar 23, 2022Updated 4 years ago
- ☆12Jul 6, 2021Updated 4 years ago
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆14Oct 15, 2021Updated 4 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- Akka Java cluster singleton example☆10Dec 5, 2023Updated 2 years ago
- ☆20Aug 21, 2025Updated 7 months ago
- Data Engineering Projects using Mage.ai as orchestrator☆18Jan 20, 2026Updated 2 months ago
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 5 years ago
- gRPC API definitions for µONOS☆17Sep 9, 2024Updated last year
- Auto reply suggestions to chat messages/emails (like gmail and linkedin) built using rasa_nlu framework.☆15Jun 20, 2018Updated 7 years ago
- ☆31Oct 17, 2018Updated 7 years ago
- materials from data science dojo☆15Aug 15, 2017Updated 8 years ago
- Datasets and code snippets of the book Pro Machine Learning☆11Dec 1, 2018Updated 7 years ago
- ☆23Jan 31, 2026Updated last month
- An interactive quiz application build on Kafka Streams, Spring MVC handler, and Vue☆12Aug 23, 2020Updated 5 years ago
- Azure Databricks workshops with content on connectivity to Azure services, data engineering workflows and data sciences notebooks.☆11Feb 20, 2019Updated 7 years ago
- Botoflow is an asynchronous framework for Amazon SWF that helps you build SWF applications using Python☆13Dec 26, 2022Updated 3 years ago