☆11Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for Batdongsan-Scrapping-ETL-Pipeline
Users that are interested in Batdongsan-Scrapping-ETL-Pipeline are comparing it to the libraries listed below
Sorting:
- ☆27Mar 22, 2024Updated last year
- Official implementation of the paper "ALTER: Augmentation for Large-Table-Based Reasoning"☆15Aug 26, 2024Updated last year
- ☆11Dec 28, 2020Updated 5 years ago
- ☆49Aug 14, 2024Updated last year
- ☆13May 1, 2024Updated last year
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 6 months ago
- ☆12Nov 18, 2022Updated 3 years ago
- School Project for Course "System Programming"☆15Jan 16, 2022Updated 4 years ago
- API/Data Platform for Ingesting, Storing, and Serving Data through Postgres, and Litestar☆11Jan 18, 2026Updated last month
- pfSense Lab in UIT University☆10May 29, 2023Updated 2 years ago
- DSP functions (utilities) with accompanying examples.☆11Apr 24, 2016Updated 9 years ago
- This repo shows you how to deploy Streamlit application via Ngrok with Colab server.☆13Dec 13, 2024Updated last year
- Docktor is a Web App that deploys an easy-to-use kit of analysis and scanning tools.☆13Nov 1, 2023Updated 2 years ago
- End to end data pipeline to extract and analyze submissions from any subreddit using Pushshift, python, dbt and BigQuery.☆12Jul 17, 2023Updated 2 years ago
- In this repository you'll find Data Science Projects☆10Mar 6, 2024Updated 2 years ago
- A fully serverless, event-driven data pipeline that ingests, enriches, validates, and visualizes real-time news data using AWS services. …☆25Aug 10, 2025Updated 7 months ago
- Exploring BERT with Kaggle disaster tweets dataset.☆12Jun 9, 2024Updated last year
- An end-to-end ELT pipeline to store simulated heart rate data inside a data warehouse; uses Kafka for real-time processing, Airbyte for d…☆14May 28, 2024Updated last year
- This repository contains a Docker Compose configuration for running ScyllaDB, a highly scalable NoSQL database for learning and testing.☆13Sep 19, 2024Updated last year
- Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"☆12May 9, 2023Updated 2 years ago
- ☆10Oct 7, 2024Updated last year
- ZMK firmware for Urchin and Corne 36 keyboard with nice!nano and nice!view☆17Jan 16, 2026Updated last month
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆13Dec 11, 2023Updated 2 years ago
- Co-occurrence analysis in pubmed and faers of two lists of terms.☆17Mar 8, 2025Updated last year
- Code Repository for my 3rd Data Project.☆16Jun 13, 2023Updated 2 years ago
- The Planet of the Bugs training platform creates fake bug scenarios. It allows developers to practice and hone their skill-sets by exposi…☆14Oct 6, 2023Updated 2 years ago
- ☆19Jul 27, 2023Updated 2 years ago
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Mar 1, 2024Updated 2 years ago
- Solution for problems in Applied Algorithm course☆18Mar 9, 2023Updated 3 years ago
- PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Co…☆18Oct 10, 2021Updated 4 years ago
- Đồ án cuối kì môn khoa học dữ liệu ứng dụng. Thu thập data bằng cách parsing HTML và sử dụng các mô hình học máy để giải quyết câu hỏi đ…☆13Dec 31, 2020Updated 5 years ago
- A simple implementation of the Unix Shell in the C Programming language. This project was coded and tested in Ubuntu 17.04.☆15Mar 19, 2018Updated 7 years ago
- ☆22May 3, 2024Updated last year
- My first attempt at a rough ETL pipeline; technologies include spark, GCS, prefect orchestration, and terraform☆14Oct 12, 2022Updated 3 years ago
- API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic con…☆14Apr 30, 2024Updated last year
- Template for graduate thesis, especially in FPT University☆19Nov 21, 2024Updated last year
- ☆21Apr 21, 2025Updated 10 months ago
- A high-performance, distributed in-memory cache library for Go that synchronizes local LFU/LRU caches across multiple service instances u…☆65Feb 23, 2026Updated 2 weeks ago
- Course Material☆20Aug 11, 2025Updated 6 months ago