Some exercises to learn Spark. Solved in Python.
☆21Oct 15, 2024Updated last year
Alternatives and similar repositories for spark-exercises
Users that are interested in spark-exercises are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Pipeline that utilizes GCP, Python 3.10, Prefect, and more.☆10Jan 23, 2023Updated 3 years ago
- Files for the Docker and Kubernetes on Google Cloud Hands-On labs☆11Mar 14, 2023Updated 3 years ago
- Short Range Ultrasonic Radar - A simple radar using the ultrasonic sensor, this radar works by measuring a range from 3cm to 40 cm as non…☆19Nov 11, 2024Updated last year
- Spark-based pipeline to extract and parse monthly games from the Lichess database.☆21Sep 22, 2025Updated 6 months ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Jun 12, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- End-to-End deployment of E-commerce customers segmentation using Clustering Machine learning algorithms in Google Cloud Platform and MLOp…☆19Jun 5, 2024Updated last year
- A forwarding mail server inspired by @alum.mit.edu☆20Mar 22, 2016Updated 10 years ago
- A basic golang app with a travis pipeline that deploys into a k8s cluster using Argo-CD☆14Aug 24, 2018Updated 7 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆17Dec 26, 2023Updated 2 years ago
- Cloud based Data Platform based on Apache Spark☆27Feb 17, 2026Updated 2 months ago
- Spark and Hive docker containers sharing a common MySQL metastore☆26Apr 17, 2020Updated 6 years ago
- A tiny wiki engine. (Fossil Export)☆13Jul 29, 2023Updated 2 years ago
- Crawling the data from lazada, websosanh, compare.vn, cdiscount and cungmua with flexible configs☆30Jul 7, 2016Updated 9 years ago
- A set of protocols for remote connection, for two people to connect while apart.☆10Sep 20, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Extension giúp ta tự động điền form Quy chế + Pháp Luật - HUST☆11Apr 9, 2026Updated last week
- blog together with your cyber buds. spatial live pseudonymous multiplayer journaling☆12Jul 28, 2021Updated 4 years ago
- Windows Agent Installer module for Jenkins☆15Apr 5, 2022Updated 4 years ago
- MuckRock User Service☆11Updated this week
- Tools for debugging memory leaks in R☆13Dec 11, 2023Updated 2 years ago
- Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collab…☆41Apr 21, 2020Updated 5 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆40Dec 15, 2025Updated 4 months ago
- A tool for detecting anomalies in time series data☆11Dec 1, 2022Updated 3 years ago
- Demonstration project for building out a data news rig.☆10Mar 15, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…☆13May 15, 2024Updated last year
- This is BetaNYC. Here, you can comment on who we are.☆11Dec 1, 2020Updated 5 years ago
- visualize an AST serialized as YAML☆13Mar 13, 2023Updated 3 years ago
- ⏰ Fetch and clean data on a schedule, using GitHub Actions + R☆10Aug 30, 2022Updated 3 years ago
- chrome extension that automatically saves liked / bookmarked tweets to Are.na☆15May 27, 2023Updated 2 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- ☆10Nov 22, 2020Updated 5 years ago
- ☆12Apr 2, 2024Updated 2 years ago
- A Jekyll blog about the history of computing☆14Dec 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Flashcard app for the terminal☆10Nov 29, 2016Updated 9 years ago
- Store parameterized queries☆15Nov 28, 2022Updated 3 years ago
- Daily e-mail from notion database☆15Apr 6, 2023Updated 3 years ago
- 将你的B站最新投稿显示在 pinned gist。☆19Aug 4, 2020Updated 5 years ago
- Chess + Rust <3. Chess engine written in Rust.☆20Nov 1, 2023Updated 2 years ago
- A productivity app for Mac OS X☆12Oct 30, 2014Updated 11 years ago
- An online, interactive coding tutorial☆11Apr 11, 2016Updated 10 years ago