Here I will be exploring various tools and methods that are used in data engineering process with Python.
☆21Jan 4, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-with-python
Users that are interested in data-engineering-with-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- (Python, PySpark)☆11Nov 15, 2020Updated 5 years ago
- Aqui você encontra os materiais de um curso COMPLETO de Graduação em Ciência da Computação de uma das melhores universidades do Brasil.☆10Aug 7, 2020Updated 5 years ago
- Stanford CS234, 2018☆10Mar 23, 2018Updated 8 years ago
- Desafio de Full-Stack do processo de recrutamento da Estudar com Você☆12Aug 5, 2019Updated 6 years ago
- Notebooks para prática de conceitos de ML☆10Apr 4, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- eBPF container escape detector prototype | Kernel 6.8+ | Early dev phase | Expect kernel panics ⚠️☆15Apr 27, 2026Updated last month
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 3 years ago
- ☆11Apr 9, 2022Updated 4 years ago
- An end-to-end project on customer segmentation☆22Mar 10, 2022Updated 4 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- Simple demonstration of interactions between a streamlit app and the mlflow tracking api☆24Apr 30, 2021Updated 5 years ago
- ☆13Jan 24, 2023Updated 3 years ago
- Implementation of Quantum Perceptron: An Artificial Neuron Implemented on an Actual Quantum Processor☆10Mar 30, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Variant for Visual GPT with better UI using h2o Wave & Single GPU support☆15Mar 12, 2023Updated 3 years ago
- Teste de rover para back-end☆18Dec 10, 2019Updated 6 years ago
- Real World Project on Formula1 Racing using Azure Databricks, Delta Lake and Azure Data Factory☆13Jul 24, 2023Updated 2 years ago
- Udacity Data Analyst Degree - Project II☆11Sep 25, 2018Updated 7 years ago
- ☆11Feb 29, 2024Updated 2 years ago
- Data Engineering Capstone☆17Oct 10, 2019Updated 6 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- Comprehensive Python Plotly tutorial & cheat sheet. Covers plotly.express, graph_objects & figure_factory for Data Science, 3D plotting, …☆23Dec 3, 2025Updated 6 months ago
- Easily import a module and mock its dependencies in an isolated way.☆13May 19, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks fo…☆13Feb 13, 2019Updated 7 years ago
- ☆21Jul 17, 2023Updated 2 years ago
- Este repositório armazena notas de aula de Estatística elaboradas para o curso preparatório de mestrado e doutorado CPAnpec. Adicionalmen…☆17Aug 17, 2021Updated 4 years ago
- StyleGAN Encoder - converts real images to latent space☆24May 26, 2019Updated 7 years ago
- E-commerce intelligent search platform. Pinecone/Devpost Hackathon 2023.☆16Aug 30, 2025Updated 9 months ago
- ☆20Apr 3, 2024Updated 2 years ago
- Tutorial for understanding CNN basics, video at http://online.codingblocks.com Machine Learning/Deep Learning course.☆12Mar 22, 2019Updated 7 years ago
- Keep learning something new☆21Jan 21, 2022Updated 4 years ago
- ☆17Feb 15, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Random Forest-based "Correlation" measures☆15May 3, 2022Updated 4 years ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆16Sep 19, 2023Updated 2 years ago
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆36Oct 11, 2016Updated 9 years ago
- A GitHub repo with materials for preparing for DP-420: Microsoft Certified: Azure Cosmos DB Developer Specialty certification Exam.☆17Jul 16, 2024Updated last year
- Discover the perfect harmony of tunes and movies!☆10Aug 17, 2023Updated 2 years ago
- 50 Kubernetes Concepts Every DevOps Engineer Should Know, Published by Packt☆42Apr 22, 2026Updated last month
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆45Jan 4, 2024Updated 2 years ago