A curated list of data engineering tools for software developers
☆13Jan 8, 2019Updated 7 years ago
Alternatives and similar repositories for awesome-data-engineering
Users that are interested in awesome-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Matching algorithms for LightGraphs.jl☆13Oct 21, 2021Updated 4 years ago
- This project deals with vulnerability analysis and classification using machine learning techniques i.e. Natural Language Processing.☆10Feb 21, 2019Updated 7 years ago
- ☆15May 31, 2023Updated 2 years ago
- A curated list of Big data papers reading for anyone who are eager to learn!☆27Dec 22, 2024Updated last year
- Reduce the visibility of elements in a Rust code base☆17Nov 9, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Big Data Resources and References☆13Sep 4, 2024Updated last year
- Perverse implementations of safe Rust traits☆22Dec 21, 2025Updated 3 months ago
- A collection of kafka-resources☆212Jan 30, 2026Updated last month
- A curated list of awesome Programming Best Practices 2023☆11Jan 2, 2023Updated 3 years ago
- A curated list of awesome Snowflake analytic data warehouse learning resources☆23Mar 1, 2021Updated 5 years ago
- Capstone project for Galvanize - Data Science Immersive. 'Project Plotline' looks at the emotional content of movie scripts (web scraping…☆16Sep 27, 2016Updated 9 years ago
- Recon - A fast algorithm to compute Reeb graphs☆16Aug 27, 2014Updated 11 years ago
- Embeddable user comments.☆131Mar 2, 2013Updated 13 years ago
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆30Jun 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Big Data Architect’s Handbook, published by Packt☆21Jan 30, 2023Updated 3 years ago
- A simple microservice project using Python, RabbitMQ, Nameko and Flask☆19May 28, 2016Updated 9 years ago
- A curated list of awesome Databricks resources, including Spark☆22Jun 28, 2024Updated last year
- This reference architecture demonstrates the use of AWS Step Functions to orchestrate an Extract Transfer Load (ETL) workflow with AWS La…☆24Jun 16, 2020Updated 5 years ago
- I self petitioned my EB1A and got approved. This repository contains my original petition, RFE response, and link to resources I used.☆27Mar 18, 2026Updated last week
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Jul 16, 2019Updated 6 years ago
- This is a material for 2 days machine learning workshop conducted in Chennai on Jan 6th and 7th☆15Feb 6, 2018Updated 8 years ago
- Orchestrate is a blockchain Transaction Orchestration system that can operate multiple chains simultaneously☆22Jun 24, 2024Updated last year
- Flow algorithms on LightGraphs☆35Jan 22, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Zabbix-Monitoring Kafka集群 Brokers服务,Kafka Consumer Monitoring☆11Jun 7, 2017Updated 8 years ago
- A collection of books/articles/resources that helped me growing as a manager☆55Mar 20, 2022Updated 4 years ago
- ☆15Jul 31, 2022Updated 3 years ago
- Predicting air pollution (machine learning project)☆21Feb 19, 2017Updated 9 years ago
- Life-cycle: Internal working of HDFS, SQOOP, HIVE, SPARK, HBASE, KAFKA with code.☆15Sep 10, 2019Updated 6 years ago
- Build ML model with meaningful variables. Use model for predictions☆17Nov 22, 2022Updated 3 years ago
- Real world Machine Learning Projects using TensorFlow by Packt Publishing☆14Jan 15, 2021Updated 5 years ago
- Flooding substrate node with transactions☆27Mar 26, 2022Updated 3 years ago
- Alternative fee mechanics for managing token assets on Substrate.☆27Nov 21, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Blog for x64dbg.☆13Jun 30, 2025Updated 8 months ago
- Seed project for C++ projects☆27Jul 14, 2021Updated 4 years ago
- This repo stores my Spark Tutorial slides.☆15Feb 8, 2016Updated 10 years ago
- Fundamental analysis for stocks and shares. ETL processes written in python using pandas☆21May 10, 2016Updated 9 years ago
- Political Discourse Analysis (PDA) of Political Speech Transcripts using Natural Language Processing (NLP)☆16Apr 28, 2021Updated 4 years ago
- Ansible Squid role☆13Sep 24, 2018Updated 7 years ago
- Fully automated dev environment setup with dotfiles☆11Apr 7, 2025Updated 11 months ago