Sample Project to Learn Data Engineering
☆10Aug 1, 2021Updated 4 years ago
Alternatives and similar repositories for learnDataEngineering
Users that are interested in learnDataEngineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆15Jun 13, 2022Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆120Jan 1, 2023Updated 3 years ago
- Some example projects for Data Engineers to build, end-to-end.☆39Nov 8, 2023Updated 2 years ago
- ☆16Updated this week
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repository for the Document streaming capstone projects☆12Nov 17, 2025Updated 6 months ago
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆25Apr 27, 2023Updated 3 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆14Jun 29, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Oyedata is a tool to perform OData assessments☆13Aug 3, 2012Updated 13 years ago
- Threat Network Detection in Online Social Networks☆12Jan 20, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Integrating Symbolic Programming and Neuromorphic Modeling for Edge Labs with NVIDIA Jetson, DGX Spark, and GPU-based DNN/ML Systems☆16Updated this week
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- DBCA IT assets management system☆13Updated this week
- Project used to generate ML.NET AutoML code for machine learning.☆11Jul 19, 2021Updated 4 years ago
- Companion to textbook "Decision Support Systems: Introduction to Data Science with Applications"☆26Sep 21, 2022Updated 3 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- 🏠 OCR action for Google Assistant/Home☆10Feb 2, 2018Updated 8 years ago
- This is a crawler that can scrap Facebook public posts, comments and also scrap the mentions & reactions.☆10Dec 7, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repository contains study materials in the form of presentations (and Python codes) to various Machine Learning techniques and also …☆26Jun 5, 2020Updated 6 years ago
- Project overview, roadmap and initial result reports☆11Aug 6, 2022Updated 3 years ago
- Red Pebble, a Google Assistant & Google Home bot, that tells you everything about the weather on Mars! A Google Actions Hackathon project…