This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health
☆29Apr 29, 2023Updated 3 years ago
Alternatives and similar repositories for ETL-Apple-Health
Users that are interested in ETL-Apple-Health are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 3 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆25May 6, 2023Updated 3 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆27Nov 8, 2022Updated 3 years ago
- Collection of my favorite Python packages from 2020☆11Jan 12, 2021Updated 5 years ago
- ☆12Jan 14, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- A demo instance of mage for pulling sample data from a public Google pub/sub topic and transforming with dbt.☆12Jan 5, 2024Updated 2 years ago
- ☆16May 29, 2023Updated 3 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- ☆22Sep 26, 2021Updated 4 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 4 months ago
- ☆13May 11, 2025Updated last year
- ☆19May 27, 2023Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learn the entire ETL process based on Spotify API data☆268Feb 1, 2021Updated 5 years ago
- Stream/batch system with Hadoop, Spark on NYC taxi data | #DE☆26Apr 10, 2026Updated 2 months ago
- ☆15Nov 16, 2023Updated 2 years ago
- Work for Mastering Large Datasets with Python☆20Dec 8, 2022Updated 3 years ago
- ☆16Jun 5, 2023Updated 3 years ago
- Repository containing example solutions for the Data Engineering Career Path Portfolio Projects☆18Sep 16, 2022Updated 3 years ago
- Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau☆32Nov 21, 2023Updated 2 years ago
- PyTorch library for breast cancer metastasis detection in whole-slide images of sentinel lymph node tissue from the Camelyon dataset☆15Nov 25, 2019Updated 6 years ago
- ☆18Mar 24, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This project aims to build a traveling recommendation application using Google Places API and OpenAI LLM.☆11Mar 19, 2024Updated 2 years ago
- ☆174May 20, 2022Updated 4 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Aug 11, 2023Updated 2 years ago
- Project on belief embedding☆23Jun 4, 2025Updated last year
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- A short demo to introduce the polars dataframe library through a marimo notebook.☆24Jan 29, 2025Updated last year
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆149Aug 23, 2023Updated 2 years ago
- 【Python / Streamlit】Pokemon Sleep 小幫手(寶可夢潛力計算、食譜篩選、寶可夢資訊)☆14May 4, 2024Updated 2 years ago
- Local single kafka instance + schema registry + zookeeper to be used as local development.☆22Mar 3, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Aug 8, 2020Updated 5 years ago
- ☆16Feb 20, 2026Updated 4 months ago
- Discover the perfect harmony of tunes and movies!☆10Aug 17, 2023Updated 2 years ago
- Data-Scenario is a repository designed to help professionals and students master data science by solving real-world problems. Each projec…☆16Oct 16, 2025Updated 8 months ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Oct 11, 2023Updated 2 years ago
- The project focuses on the drowsiness of IT employees, drivers, pilots, crane operators, student etc. These people need a system which ca…☆14Sep 13, 2018Updated 7 years ago
- The repository includes detailed steps to get data from GES DISC, convert HDF5 files to CSV and plotting geographic data.☆11Aug 17, 2020Updated 5 years ago