Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)
☆15Dec 16, 2021Updated 4 years ago
Alternatives and similar repositories for data-engineering-challenge-th
Users that are interested in data-engineering-challenge-th are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Challenge Data Engineer☆25Jun 13, 2022Updated 4 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 5 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆15Jun 13, 2022Updated 4 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆14Jun 29, 2022Updated 3 years ago
- Term Frequency-Inverse Document Frequency from Scratch☆14Sep 19, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆123Jun 29, 2022Updated 3 years ago
- Repository for the Document streaming capstone projects☆12Nov 17, 2025Updated 6 months ago
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- NLP Model for predicting 17 different languages☆16Oct 19, 2023Updated 2 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆29Jun 13, 2022Updated 4 years ago
- Price Crawler - Tracking Price Inflation☆205Jun 23, 2020Updated 5 years ago
- Welcome! This dbt project is built to be imported to a freshly-initialized dbt project to work through the hands-on zero to dbt lab detai…☆19Apr 27, 2023Updated 3 years ago
- A tool to automatically infer columns data types in .csv files☆37Jan 28, 2023Updated 3 years ago
- All important Python tools a Data Engineer needs☆28Jun 4, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Course Material Data Engineering on AWS Course☆31Sep 9, 2024Updated last year
- Tablas de código postal argentino☆14Jul 16, 2017Updated 8 years ago
- A simple Flask app that uses render_template, to teach different features of Flask.☆10Feb 2, 2023Updated 3 years ago
- Python Wrapper for Survey Solutions API☆11Mar 26, 2026Updated 2 months ago
- Migrated out of GitHub☆11Jan 10, 2021Updated 5 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Download Google Map Satellite Image Using Python☆15Sep 26, 2019Updated 6 years ago
- Downloads OCDS data and stores it on disk☆16Jun 3, 2026Updated last week
- Docker-compose samples☆15May 10, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository is a part of http://digiwhist.eu/ project. It contains source codes of a data processing system that ensures a collection…☆11Mar 27, 2023Updated 3 years ago
- ☆38Jul 18, 2023Updated 2 years ago
- A leafletjs plugin to filter geojson marker based on its properties☆14Aug 16, 2016Updated 9 years ago
- Docker Blueprint for a GeoNode Installation☆14Jul 8, 2025Updated 11 months ago
- A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large …☆17Updated this week
- Ultra Fast Multi-Modality Vector Database☆18Feb 21, 2024Updated 2 years ago
- A Flat Data GitHub Action demo repo☆20Feb 11, 2022Updated 4 years ago
- Automatic Table reader. Can extract table data from images.☆15Dec 1, 2018Updated 7 years ago
- Convert audio file to text☆15Jun 18, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Telegram bot to use ChatGPT with vocal commands☆17Mar 8, 2023Updated 3 years ago
- Estándares de trabajo del equipo Datos Argentina.☆12Apr 22, 2021Updated 5 years ago
- Código de @columnistos y sus hermanas☆12May 14, 2024Updated 2 years ago
- ☆11Apr 26, 2022Updated 4 years ago
- Cheat sheet for GDAL/OGR command-line tools☆23Oct 19, 2015Updated 10 years ago
- CLI Java wrapper for the PhotoDNA library☆26Nov 11, 2025Updated 7 months ago
- ICDE 2024 Paper, MetaSQL: A Generate-then-Rank Framework for Natural Language to SQL Translation☆27May 9, 2025Updated last year