Some exercises to learn Spark. Solved in Python.
☆21Oct 15, 2024Updated last year
Alternatives and similar repositories for spark-exercises
Users that are interested in spark-exercises are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run an open-source data LakeHouse locally using Docker Compose☆12May 31, 2024Updated last year
- Iot,Big Data Analytics using Apache-kafka,spark and other aws services☆16Sep 11, 2020Updated 5 years ago
- A Firebase Cloud Function and a Firebase hosted web app to treat weather data collected by Cloud IoT Core☆18Mar 10, 2019Updated 7 years ago
- preparation guide for aws big data / data analytics – specialty exam☆18Nov 23, 2020Updated 5 years ago
- Short Range Ultrasonic Radar - A simple radar using the ultrasonic sensor, this radar works by measuring a range from 3cm to 40 cm as non…☆19Nov 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…☆12Sep 9, 2023Updated 2 years ago
- End-to-End deployment of E-commerce customers segmentation using Clustering Machine learning algorithms in Google Cloud Platform and MLOp…☆19Jun 5, 2024Updated last year
- pytest support for airflow☆12Apr 20, 2021Updated 5 years ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 6 years ago
- A tiny wiki engine. (Fossil Export)☆13Jul 29, 2023Updated 2 years ago
- Study materials for the AWS Big Data / Data Analytics Specialty Exam☆27Apr 7, 2022Updated 4 years ago
- Crawling the data from lazada, websosanh, compare.vn, cdiscount and cungmua with flexible configs☆30Jul 7, 2016Updated 9 years ago
- A set of protocols for remote connection, for two people to connect while apart.☆10Sep 20, 2022Updated 3 years ago
- blog together with your cyber buds. spatial live pseudonymous multiplayer journaling☆12Jul 28, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Website/code for daily code sketches☆12Aug 4, 2025Updated 9 months ago
- ☆14May 6, 2022Updated 4 years ago
- Tools for debugging memory leaks in R☆13Dec 11, 2023Updated 2 years ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆41Apr 21, 2020Updated 6 years ago
- ☆196Feb 13, 2021Updated 5 years ago
- Zero-dependency Java client for HashiCorp's Vault☆40Dec 14, 2025Updated 5 months ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆40Dec 15, 2025Updated 5 months ago
- A tool for detecting anomalies in time series data☆11Dec 1, 2022Updated 3 years ago
- Demonstration project for building out a data news rig.☆10Mar 15, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is BetaNYC. Here, you can comment on who we are.☆11Dec 1, 2020Updated 5 years ago
- visualize an AST serialized as YAML☆13Mar 13, 2023Updated 3 years ago
- ⏰ Fetch and clean data on a schedule, using GitHub Actions + R☆10Aug 30, 2022Updated 3 years ago
- chrome extension that automatically saves liked / bookmarked tweets to Are.na☆16May 27, 2023Updated 3 years ago
- Generate diff comment between two directories in GitHub Actions☆21Updated this week
- ☆12Apr 2, 2024Updated 2 years ago
- ☆11Nov 22, 2020Updated 5 years ago
- A Jekyll blog about the history of computing☆13Dec 8, 2024Updated last year
- Store parameterized queries☆15Nov 28, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Daily e-mail from notion database☆15Apr 6, 2023Updated 3 years ago
- Chess + Rust <3. Chess engine written in Rust.☆20Nov 1, 2023Updated 2 years ago
- A productivity app for Mac OS X☆12Oct 30, 2014Updated 11 years ago
- D3.js resources for SND/NYC 2018☆13Mar 22, 2018Updated 8 years ago
- CSS workshop on word embeddings for the social sciences, 3/19/21☆12Mar 19, 2021Updated 5 years ago
- ☆14Jun 21, 2021Updated 4 years ago
- Some TikZ and PGFPlots examples☆12Jun 15, 2021Updated 4 years ago