Here I will be exploring various tools and methods that are used in data engineering process with Python.
☆21Jan 4, 2021Updated 5 years ago
Alternatives and similar repositories for data-engineering-with-python
Users that are interested in data-engineering-with-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Aqui você encontra os materiais de um curso COMPLETO de Graduação em Ciência da Computação de uma das melhores universidades do Brasil.☆10Aug 7, 2020Updated 5 years ago
- Kubeflow MLOps pipeline using GitHub Actions☆13Feb 7, 2023Updated 3 years ago
- Desafio de Full-Stack do processo de recrutamento da Estudar com Você☆11Aug 5, 2019Updated 6 years ago
- Programming solution for Hackerrank certification questions. Language: Python3☆12Aug 24, 2023Updated 2 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 3 years ago
- ☆11Apr 9, 2022Updated 4 years ago
- A Pyspark job to handle upserts, conversion to parquet and create partitions on S3☆27Jul 23, 2020Updated 5 years ago
- How Will Your Tweet Be Received? Predicting theSentiment Polarity of Tweet Replies☆11Aug 29, 2021Updated 4 years ago
- Implementation of Quantum Perceptron: An Artificial Neuron Implemented on an Actual Quantum Processor☆10Mar 30, 2025Updated last year
- Variant for Visual GPT with better UI using h2o Wave & Single GPU support☆15Mar 12, 2023Updated 3 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- This is the final project that after participated the Data Engineering Zoomcamp☆11Apr 4, 2022Updated 4 years ago
- Teste de rover para back-end☆18Dec 10, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Feb 29, 2024Updated 2 years ago
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- Data Engineering Capstone☆17Oct 10, 2019Updated 6 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- Comprehensive Python Plotly tutorial & cheat sheet. Covers plotly.express, graph_objects & figure_factory for Data Science, 3D plotting, …☆22Dec 3, 2025Updated 5 months ago
- Easily import a module and mock its dependencies in an isolated way.☆13May 19, 2022Updated 4 years ago
- This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks fo…☆13Feb 13, 2019Updated 7 years ago
- ☆21Jul 17, 2023Updated 2 years ago
- E-commerce intelligent search platform. Pinecone/Devpost Hackathon 2023.☆16Aug 30, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Tutorial for understanding CNN basics, video at http://online.codingblocks.com Machine Learning/Deep Learning course.☆12Mar 22, 2019Updated 7 years ago
- Predicting the medal table of the Summer Games☆12Jul 6, 2023Updated 2 years ago
- remaking Aman Kharwal's 60 Python Projects with Source Code☆17Feb 23, 2021Updated 5 years ago
- Keep learning something new☆21Jan 21, 2022Updated 4 years ago
- ☆17Feb 15, 2023Updated 3 years ago
- Random Forest-based "Correlation" measures☆15May 3, 2022Updated 4 years ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆16Sep 19, 2023Updated 2 years ago
- A GitHub repo with materials for preparing for DP-420: Microsoft Certified: Azure Cosmos DB Developer Specialty certification Exam.☆17Jul 16, 2024Updated last year
- 【Python / Streamlit】Pokemon Sleep 小幫手(寶可夢潛力計算、食譜篩選、寶可夢資訊)☆14May 4, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Detailed notes and code to learn the basics of machine learning with scikit-learn.☆36Oct 11, 2016Updated 9 years ago
- ☆16Aug 13, 2020Updated 5 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- Repository for my Talos Linux/Kubernetes cluster☆22Updated this week
- ☆17Jun 23, 2024Updated last year
- Code and documentation for the demonstration example of the real-time bushfire alerting with the Complex Event Processing (CEP) in Apache…☆26Sep 14, 2018Updated 7 years ago
- meta_llama_2finetuned_text_generation_summarization☆21Jul 21, 2023Updated 2 years ago