Banana1206 / TikiAPI-WebScraping
Crawl data from the TIKI e-commerce, designing a data warehouse, implementing an ETL (Extract, Transform, Load) process, and loading the data into MySQL.
☆14Updated last year
Alternatives and similar repositories for TikiAPI-WebScraping:
Users that are interested in TikiAPI-WebScraping are comparing it to the libraries listed below
- Machine Translation using Seq2Seq and Transformer Models☆12Updated last year
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆12Updated 3 years ago
- Tiểu Luận Chuyên Ngành☆11Updated 6 months ago
- This project aims to build a streaming application to perform real-time analytics of Covid-19 related tweets and deploy an ML model for r…☆12Updated 3 years ago
- Nyc_Taxi_Data_Pipeline - DE Project☆89Updated 3 months ago
- Analyzing Spotify Data with Pyspark and ETL Procedures☆23Updated 3 months ago
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆10Updated last year
- Sentiment Analysis for Vietnamese Language☆14Updated 4 years ago
- ☆28Updated 11 months ago
- ☆27Updated 9 months ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆100Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- This repo gives an introduction to setting up streaming analytics using open source technologies☆24Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- ☆19Updated last year
- Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transform…☆25Updated last year
- ☆22Updated 10 months ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- ☆189Updated last year
- ☆48Updated 4 months ago
- Vietnam stock price crawling☆18Updated 2 years ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆25Updated 3 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- ☆60Updated last year
- Simple ETL pipeline using Python☆24Updated last year
- ☆44Updated 3 weeks ago
- Public Docker Images for popular services☆17Updated 3 weeks ago
- Conduct a Report and Analysis on 200,000 sales data points to answer revenue-related questions for the business☆22Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- ☆14Updated last year