Sample Project to Learn Data Engineering
☆10Aug 1, 2021Updated 4 years ago
Alternatives and similar repositories for learnDataEngineering
Users that are interested in learnDataEngineering are comparing it to the libraries listed below
Sorting:
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆117Jan 1, 2023Updated 3 years ago
- Some example projects for Data Engineers to build, end-to-end.☆38Nov 8, 2023Updated 2 years ago
- ☆16Mar 11, 2026Updated last week
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- Repository for the Document streaming capstone projects☆12Nov 17, 2025Updated 4 months ago
- Dockerizing an Apache Spark Standalone Cluster☆42Jun 29, 2022Updated 3 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 2 years ago
- A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler…☆13Jun 29, 2022Updated 3 years ago
- Repository for Apache Spark course at Team Data Science☆17Oct 23, 2020Updated 5 years ago
- Threat Network Detection in Online Social Networks☆10Jan 20, 2017Updated 9 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Oyedata is a tool to perform OData assessments☆13Aug 3, 2012Updated 13 years ago
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- Integrating Symbolic Programming and Neuromorphic Modeling for Edge Labs with NVIDIA Jetson, DGX Spark, and GPU-based DNN/ML Systems☆15Mar 3, 2026Updated 2 weeks ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- DBCA IT assets management system☆13Updated this week
- Project used to generate ML.NET AutoML code for machine learning.☆11Jul 19, 2021Updated 4 years ago
- Companion to textbook "Decision Support Systems: Introduction to Data Science with Applications"☆26Sep 21, 2022Updated 3 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- 🏠 OCR action for Google Assistant/Home☆10Feb 2, 2018Updated 8 years ago
- This is a crawler that can scrap Facebook public posts, comments and also scrap the mentions & reactions.☆10Dec 7, 2022Updated 3 years ago
- Project overview, roadmap and initial result reports☆11Aug 6, 2022Updated 3 years ago
- This repository contains study materials in the form of presentations (and Python codes) to various Machine Learning techniques and also …☆26Jun 5, 2020Updated 5 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- Sharp AGI - Dotnet☆13Apr 27, 2023Updated 2 years ago
- Red Pebble, a Google Assistant & Google Home bot, that tells you everything about the weather on Mars! A Google Actions Hackathon project…☆11Oct 11, 2018Updated 7 years ago
- My README profile☆21Jan 11, 2026Updated 2 months ago
- MEMEX Weapons Pilot for the illegal weapons domain.☆15May 20, 2016Updated 9 years ago
- An API that encodes data privacy and protection laws from around the world and returns risk and compliance assessments☆11Jul 3, 2017Updated 8 years ago
- Lin Pengcheng Financial Analyser Homepage (林鹏程财务分析软件)☆12Jan 16, 2021Updated 5 years ago
- A drop-in "fs" replacement for accessing Azure Storage with Node.js "fs" API☆10Jul 9, 2020Updated 5 years ago
- ☆15Jun 20, 2017Updated 8 years ago
- DISCO: Comprehensive and Explainable Disinformation Detection, CIKM 2022☆10May 5, 2023Updated 2 years ago
- how to unit test your PySpark code☆29Mar 26, 2021Updated 4 years ago
- A Flask application for analyzing activity on an online discussion forum, using scraping, indexing, analytics, relational graph and NLP.☆11Nov 24, 2020Updated 5 years ago
- This collaborative resource aims at empowering all actors countering information manipulation to grow and improve.☆16Dec 23, 2025Updated 2 months ago
- Samples on how to import data (Flat Files, CSV, JSON) in Azure SQL☆23Oct 6, 2022Updated 3 years ago