☆32Aug 13, 2018Updated 7 years ago
Alternatives and similar repositories for data-engineering
Users that are interested in data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jul 13, 2020Updated 5 years ago
- Aqui você encontra os materiais de um curso COMPLETO de Graduação em Ciência da Computação de uma das melhores universidades do Brasil.☆10Aug 7, 2020Updated 5 years ago
- ☆19Dec 16, 2021Updated 4 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆74Oct 3, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tarot widget for website☆12Jan 6, 2023Updated 3 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Oct 14, 2019Updated 6 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- A boilerplate project for Azure Big Data PaaS services☆14Dec 7, 2022Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆28Nov 10, 2021Updated 4 years ago
- Notebooks para prática de conceitos de ML☆10Apr 4, 2023Updated 3 years ago
- Browser Automation with Python and Selenium by Packt Publishing☆11Jan 30, 2023Updated 3 years ago
- Introduction to MLflow and Using MLflow with an Anaconda Environment☆11Sep 17, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source Code for 'Beginning Apache Spark 3' by Hien Luu☆13Oct 14, 2021Updated 4 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago
- Project to experiment with a microservices architecture based on Apache Kafka☆23Jul 8, 2023Updated 2 years ago
- 3 proyectos de Web Scraping utilizando BeautifulSoup, Selenium, APIs respectivamente.☆10Nov 10, 2020Updated 5 years ago
- ☆41Nov 2, 2021Updated 4 years ago
- Distributed Data Systems with Azure Databricks, published by Packt☆12Jan 18, 2023Updated 3 years ago
- Sentiment Analyzer para Twitter en español mediante NLP y machine learning☆11Jan 25, 2021Updated 5 years ago
- Book Website: Dynamic System Modelling & Analysis with MATLAB & Pythobn☆24Jul 5, 2022Updated 3 years ago
- My MSc project☆14Jun 5, 2011Updated 15 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆47Oct 23, 2021Updated 4 years ago
- Data Brewery is an ETL (Extract-Transform-Load) program that connect to many data sources (cloud services, databases, ...) and manage dat…☆16Jan 21, 2021Updated 5 years ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- En este proyecto de GitHhub podrás encontrar parte del material que utilizo para impartir las clases del módulo introductorio de Reinforc…☆11Apr 22, 2022Updated 4 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- ☆13Oct 6, 2019Updated 6 years ago
- Google FSI Accelerator Pattern☆13Jun 18, 2024Updated last year
- This repository contains all the resources and solution to quizzes given and asked in IBM Data Science Professional Certification.☆13Sep 16, 2022Updated 3 years ago
- Easy and simple song downloader for downloading a song just by entering name.☆10Aug 24, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13May 1, 2020Updated 6 years ago
- This sample shows how to create two Azure Container Apps that use OpenAI, LangChain, ChromaDB, and Chainlit using Terraform.☆11May 7, 2024Updated 2 years ago
- Library for converting pandas dataframes into pydantic models☆17Mar 30, 2025Updated last year
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆21Jan 4, 2021Updated 5 years ago
- ☆21Jan 14, 2016Updated 10 years ago
- A web app that reads a list of urls and displays their current status.☆12Dec 4, 2024Updated last year
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 3 years ago