VinhQuocTran / Batdongsan-Scrapping-ETL-PipelineView external linksLinks
☆10Feb 27, 2024Updated last year
Alternatives and similar repositories for Batdongsan-Scrapping-ETL-Pipeline
Users that are interested in Batdongsan-Scrapping-ETL-Pipeline are comparing it to the libraries listed below
Sorting:
- ☆27Mar 22, 2024Updated last year
- ☆11Dec 28, 2020Updated 5 years ago
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- ☆49Aug 14, 2024Updated last year
- ☆13May 1, 2024Updated last year
- DSP functions (utilities) with accompanying examples.☆11Apr 24, 2016Updated 9 years ago
- ☆12Nov 18, 2022Updated 3 years ago
- School Project for Course "System Programming"☆15Jan 16, 2022Updated 4 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 5 months ago
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Jan 18, 2026Updated last month
- pfSense Lab in UIT University☆10May 29, 2023Updated 2 years ago
- An end-to-end ELT pipeline to store simulated heart rate data inside a data warehouse; uses Kafka for real-time processing, Airbyte for d…☆14May 28, 2024Updated last year
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- In this repository you'll find Data Science Projects☆10Mar 6, 2024Updated last year
- This repo shows you how to deploy Streamlit application via Ngrok with Colab server.☆13Dec 13, 2024Updated last year
- Exploring BERT with Kaggle disaster tweets dataset.☆12Jun 9, 2024Updated last year
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆13Sep 19, 2024Updated last year
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆24Aug 10, 2025Updated 6 months ago
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 2 years ago
- Co-occurrence analysis in pubmed and faers of two lists of terms.☆17Mar 8, 2025Updated 11 months ago
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆17Jan 16, 2026Updated last month
- ☆10Oct 7, 2024Updated last year
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆13Dec 11, 2023Updated 2 years ago
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- A simple implementation of the Unix Shell in the C Programming language. This project was coded and tested in Ubuntu 17.04.☆14Mar 19, 2018Updated 7 years ago
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Mar 1, 2024Updated last year
- Solution for problems in Applied Algorithm course☆18Mar 9, 2023Updated 2 years ago
- Đồ án cuối kì môn khoa học dữ liệu ứng dụng. Thu thập data bằng cách parsing HTML và sử dụng các mô hình học máy để giải quyết câu hỏi đ…☆13Dec 31, 2020Updated 5 years ago
- The Planet of the Bugs training platform creates fake bug scenarios. It allows developers to practice and hone their skill-sets by exposi…☆14Oct 6, 2023Updated 2 years ago
- ☆19Jul 27, 2023Updated 2 years ago
- PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Co…☆18Oct 10, 2021Updated 4 years ago
- ☆21Apr 21, 2025Updated 9 months ago
- API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic con…☆14Apr 30, 2024Updated last year
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- ☆22May 3, 2024Updated last year
- Course Material☆20Aug 11, 2025Updated 6 months ago
- My personal project for data engineering zoomcamp☆18Dec 13, 2024Updated last year
- LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks☆18Oct 3, 2024Updated last year