Banana1206 / TikiAPI-WebScraping
Crawl data from the TIKI e-commerce, designing a data warehouse, implementing an ETL (Extract, Transform, Load) process, and loading the data into MySQL.
☆14Updated last year
Alternatives and similar repositories for TikiAPI-WebScraping:
Users that are interested in TikiAPI-WebScraping are comparing it to the libraries listed below
- Machine Translation using Seq2Seq and Transformer Models☆12Updated last year
- Tiểu Luận Chuyên Ngành☆10Updated 7 months ago
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆13Updated 3 years ago
- Nyc_Taxi_Data_Pipeline - DE Project☆94Updated 4 months ago
- Data Pipeline from the Global Historical Climatology Network DataSet☆26Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- Analyzing Spotify Data with Pyspark and ETL Procedures☆23Updated 4 months ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆29Updated 4 months ago
- My documents for self-learning fundamental of Data engineering skills☆12Updated last year
- ☆50Updated 5 months ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆20Updated 3 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- This repo gives an introduction to setting up streaming analytics using open source technologies☆24Updated last year
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆10Updated last year
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆24Updated 2 years ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆34Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆8Updated 3 years ago
- Sentiment Analysis for Vietnamese Language☆14Updated 4 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆36Updated last year
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆25Updated last year
- End-to-End BI & DW project: Data Warehousing design and modeling (MySQL), ETL (PDI) and Dashboard (Tableau)☆15Updated 4 years ago
- Vietnam stock price crawling☆19Updated 2 years ago
- Code Repository for my 3rd Data Project.☆14Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆79Updated 6 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆117Updated last year
- ☆41Updated 7 months ago
- Building ETL Pipelines with Python☆124Updated 7 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆107Updated 2 years ago
- End-to-end ELT data engineering project☆20Updated 2 years ago