A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee
☆61Jul 2, 2018Updated 7 years ago
Alternatives and similar repositories for PySparkCookbook
Users that are interested in PySparkCookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- Code base for the Learning PySpark book (in preparation)☆631Apr 16, 2019Updated 7 years ago
- ☆16Jun 27, 2020Updated 5 years ago
- Code repository for Learning PySpark by Packt☆344Jan 30, 2023Updated 3 years ago
- Material do artigo: Como Criar um Sistema de Recomendação de Produtos Usando Machine Learning☆11Feb 1, 2017Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A collection of data and codes to supplement the practicalDataAnalysisCookbook (in preparation)☆22Mar 30, 2016Updated 10 years ago
- Source code for 'Practical Hive' by Scott Shaw, Andreas François Vermeulen, Ankur Gupta, and David Kjerrumgaard☆34Oct 16, 2017Updated 8 years ago
- Badminton coach ai: A badminton match data analysis platform based on deep learning (Physical Education Journal'20)☆19Nov 23, 2022Updated 3 years ago
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆15Dec 15, 2025Updated 5 months ago
- My data science curriculum☆14May 14, 2023Updated 3 years ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,352Dec 7, 2025Updated 5 months ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆26Nov 30, 2019Updated 6 years ago
- A small library for accessing UK postcode data using UK Postcodes (and thus Ordnance Survey Open Data)☆18Jun 15, 2016Updated 9 years ago
- Source code for the post, 'Getting Started with Data Analysis on AWS, using S3, Glue, Amazon Athena, and QuickSight'☆29Dec 22, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Fundamentals of Apache Flink [video], published by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆12Jul 6, 2021Updated 4 years ago
- ☆12Apr 1, 2026Updated last month
- Sample how to use Camunda DMN decisions in a Zeebe Workflow☆11Apr 13, 2022Updated 4 years ago
- Spark pipelines that correspond to a series of Dataflow examples.☆27May 5, 2019Updated 7 years ago
- ☆18Jun 6, 2022Updated 3 years ago
- ☆14Jun 22, 2020Updated 5 years ago
- Manifests list for a multi-arch Docker image☆11Jan 23, 2019Updated 7 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Apr 15, 2020Updated 6 years ago
- gRPC API definitions for µONOS☆17Sep 9, 2024Updated last year
- A Proteus example using RSocket RPC, and Kafka☆19Feb 8, 2019Updated 7 years ago
- ☆31Oct 17, 2018Updated 7 years ago
- Inference and deployment toolkit for Svara-TTS, an open-source multilingual text-to-speech model for Indic languages☆22Apr 1, 2026Updated last month
- Datasets and code snippets of the book Pro Machine Learning☆11Dec 1, 2018Updated 7 years ago
- Azure Databricks workshops with content on connectivity to Azure services, data engineering workflows and data sciences notebooks.☆11Feb 20, 2019Updated 7 years ago
- A Python script to swoop and decrypt passwords from Chrome's local storage.☆11Dec 10, 2018Updated 7 years ago
- DuckDB Explain Visualizer (DEV) based on pev2☆39Aug 22, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Movie recommender system with Collaborative Filtering using PySpark☆28Apr 17, 2017Updated 9 years ago
- ☆21May 14, 2026Updated last week
- Official Implementation for ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AA…☆31Nov 23, 2022Updated 3 years ago
- The project involved developing a credit risk default model on Indian companies using the performance data of several companies to predic…☆10Nov 9, 2021Updated 4 years ago
- This project is to integration HP ALM and other test automation frameworks.☆10May 25, 2020Updated 5 years ago
- AWS ECR Docker projects☆21Jul 4, 2024Updated last year
- ☆23Dec 21, 2021Updated 4 years ago