Challenge Data Engineer
☆25Jun 13, 2022Updated 3 years ago
Alternatives and similar repositories for data-engineer-challenge
Users that are interested in data-engineer-challenge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆123Jun 29, 2022Updated 3 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Term Frequency-Inverse Document Frequency from Scratch☆14Sep 19, 2021Updated 4 years ago
- Sample Project to Learn Data Engineering☆10Aug 1, 2021Updated 4 years ago
- SCIM 2.0 JAVA development kit☆18May 2, 2025Updated 10 months ago
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- ☆11Oct 8, 2021Updated 4 years ago
- Source code related of the articles posted in medium.com☆12Nov 2, 2020Updated 5 years ago
- Command line client for the Fugue API☆14Mar 7, 2023Updated 3 years ago
- ☆23Jun 2, 2021Updated 4 years ago
- Tool which summarizes daily and total gas consumption of all transactions sent from a specified Ethereum address.☆15Jun 28, 2023Updated 2 years ago
- Resources backing the Feast fraud tutorial on GCP☆14May 31, 2022Updated 3 years ago
- Solutions & Code Related to Blog Posts☆11Nov 6, 2024Updated last year
- Testing Boring SL with DuckDB☆32Aug 18, 2025Updated 7 months ago
- Project based learning for Data Engineering fundamentals.☆13Jan 15, 2021Updated 5 years ago
- dbt project for the domestic heating agent-based model at Centre for Net Zero.☆12Nov 15, 2022Updated 3 years ago
- Open source petroleum engineering projects, useful scripts, functions, and jupyter notebooks☆20Nov 17, 2020Updated 5 years ago
- A living resource to help developers get started with assemblyscript.☆15Feb 26, 2022Updated 4 years ago
- ☆10Apr 13, 2022Updated 3 years ago
- Data Engineering Hours With Experts Coding Challenge☆13Mar 16, 2026Updated last week
- GraphRAG: Knowledge in Graphs not Documents☆17Jul 5, 2025Updated 8 months ago
- Powershell Scripts for Power BI☆13Sep 20, 2023Updated 2 years ago
- Basic Udacity project using pandas for bikeshare data exploration☆10Nov 29, 2021Updated 4 years ago
- Repositorio utilizado para el Curso de Apache Spark en Platzi☆20Feb 20, 2021Updated 5 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 2 years ago
- ImageTester is a Cli tool to perform visual tests on images or PDF files.☆10Feb 18, 2026Updated last month
- Apache NiFi deployment on OpenShift☆13Jul 18, 2023Updated 2 years ago
- ☆20Updated this week
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆17Jan 30, 2021Updated 5 years ago
- Pre-processing time series data from Open Power Systems Data☆13Jul 27, 2021Updated 4 years ago
- Wrapper for Spotify API that generates user-specific playlists☆14Feb 15, 2023Updated 3 years ago
- ☆19Aug 13, 2022Updated 3 years ago
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- Set of scripts to terminate various GCP resources to save cash and cats 🐈☆14Jan 25, 2021Updated 5 years ago