Here I will be exploring various tools and methods that are used in data engineering process with Python.
☆21Jan 4, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-with-python
Users that are interested in data-engineering-with-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- eBPF container escape detector prototype | Kernel 6.8+ | Early dev phase | Expect kernel panics ⚠️☆16Apr 27, 2026Updated 2 months ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 3 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- Demo Repository for eBPF XDP Unit Test☆12Oct 24, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- How Will Your Tweet Be Received? Predicting theSentiment Polarity of Tweet Replies☆12Aug 29, 2021Updated 4 years ago
- ☆13Jan 24, 2023Updated 3 years ago
- Implementation of Quantum Perceptron: An Artificial Neuron Implemented on an Actual Quantum Processor☆10Mar 30, 2025Updated last year
- Variant for Visual GPT with better UI using h2o Wave & Single GPU support☆15Mar 12, 2023Updated 3 years ago
- Book Website: Dynamic System Modelling & Analysis with MATLAB & Pythobn☆24Jul 5, 2022Updated 3 years ago
- ☆14Dec 5, 2021Updated 4 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- From data gathering to model deployment. Complete ML pipeline using Docker, Airflow and Python.☆13Oct 10, 2023Updated 2 years ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Jul 24, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Oct 8, 2022Updated 3 years ago
- Udacity Data Analyst Degree - Project II☆11Sep 25, 2018Updated 7 years ago
- ☆11Feb 29, 2024Updated 2 years ago
- ☆22Mar 25, 2024Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- Data Engineering Capstone☆17Oct 10, 2019Updated 6 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 4 years ago
- Comprehensive Python Plotly tutorial & cheat sheet. Covers plotly.express, graph_objects & figure_factory for Data Science, 3D plotting, …☆23Dec 3, 2025Updated 7 months ago
- Easily import a module and mock its dependencies in an isolated way.☆13May 19, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Feature Store with Spark Streaming, Kafka, Redis☆14Aug 13, 2021Updated 4 years ago
- Trilha de com conceitos de Estatística para Data Science. Inseri links para materiais gratuitos e sites oficiais para coletar dados.☆27Jul 14, 2021Updated 4 years ago
- Este repositório armazena notas de aula de Estatística elaboradas para o curso preparatório de mestrado e doutorado CPAnpec. Adicionalmen…☆17Aug 17, 2021Updated 4 years ago
- Tutorial for understanding CNN basics, video at http://online.codingblocks.com Machine Learning/Deep Learning course.☆12Mar 22, 2019Updated 7 years ago
- This repository contains examples of Kyverno policies for controlling the creation of Cilium Network policies☆22Nov 2, 2023Updated 2 years ago
- Keep learning something new☆21Jan 21, 2022Updated 4 years ago
- Random Forest-based "Correlation" measures☆15May 3, 2022Updated 4 years ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆16Sep 19, 2023Updated 2 years ago
- Physics Informed Neural Networks☆20Sep 2, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆36Oct 11, 2016Updated 9 years ago
- A GitHub repo with materials for preparing for DP-420: Microsoft Certified: Azure Cosmos DB Developer Specialty certification Exam.☆17Jul 16, 2024Updated last year
- 【Python / Streamlit】Pokemon Sleep 小幫手(寶可夢潛力計算、食譜篩選、寶可夢資訊)☆14May 4, 2024Updated 2 years ago
- ☆16Aug 13, 2020Updated 5 years ago
- Discover the perfect harmony of tunes and movies!☆10Aug 17, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆45Jan 4, 2024Updated 2 years ago
- GlyphsApp Scripts☆11Aug 15, 2023Updated 2 years ago