Repo to migrate old wiki to, esp for devs and code examples
☆181Oct 18, 2016Updated 9 years ago
Alternatives and similar repositories for data-engineering-ecosystem
Users that are interested in data-engineering-ecosystem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sharing interesting and noteworthy Data Engineering content☆68Oct 21, 2016Updated 9 years ago
- ☆14Jun 27, 2017Updated 9 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆62Feb 23, 2015Updated 11 years ago
- Directions and Source code for Insight's Docker workshop.☆22Jun 21, 2022Updated 4 years ago
- Red Hat's business logic for maintaining marketing data quality☆12Oct 21, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A way for home buyers to know about factors affecting a state☆48Mar 2, 2019Updated 7 years ago
- KDD Hands-On Tutorial (2018)☆29Dec 8, 2022Updated 3 years ago
- ☆25Aug 23, 2017Updated 8 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898May 8, 2022Updated 4 years ago
- ☆31Jun 4, 2020Updated 6 years ago
- Building Scio from scratch step by step☆20May 20, 2019Updated 7 years ago
- A curated list of data engineering tools for software developers☆8,773Jun 22, 2026Updated last week
- How to build an awesome data engineering team☆101Sep 11, 2019Updated 6 years ago
- Random implementation notes☆34Apr 23, 2013Updated 13 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆28Nov 4, 2017Updated 8 years ago
- WARNING: This repository is no longer maintained The Insights for Twitter service from IBM Cloud has been sunset. This repository will n…☆11Apr 10, 2019Updated 7 years ago
- Examples of deploying scikit, spaCy and Keras (TensorFlow) machine learning models to AWS Lambda with Serverless framework and Python 3.☆31Dec 8, 2022Updated 3 years ago
- Data pipeline is a tool to run Data loading pipelines. It is an open sourced app engine app that users can extend to suit their own needs…☆87Feb 11, 2014Updated 12 years ago
- Miscellaneous Projects☆16Sep 20, 2020Updated 5 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆275Mar 1, 2026Updated 3 months ago
- An example todo application using Posh☆26Jun 26, 2016Updated 10 years ago
- ☆11Jul 15, 2014Updated 11 years ago
- AWS lambda functions - utilities☆12Jul 8, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Luigi integration for Google BigQuery☆15Nov 18, 2015Updated 10 years ago
- Example end to end data engineering project.☆1,414Dec 8, 2022Updated 3 years ago
- This repo contains commands that data engineers use in day to day work.☆62Feb 4, 2023Updated 3 years ago
- A Rust based deduplication tool☆34Jun 26, 2025Updated last year
- ☆11Jan 8, 2023Updated 3 years ago
- Code to build a simple analytics data pipeline with Python☆102Mar 11, 2017Updated 9 years ago
- Some thoughts on how to use machine learning in production☆70May 17, 2017Updated 9 years ago
- The Data Engineering Cookbook☆15,162Jun 12, 2026Updated 2 weeks ago
- A list of useful resources to learn Data Engineering from scratch☆3,996Jun 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Nov 28, 2022Updated 3 years ago
- pip-installable SQLite extensions☆15Feb 23, 2023Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆167Jun 16, 2020Updated 6 years ago
- Data Mining and Analytics in Intelligent Business Services, UC Berkeley School of Information☆20May 17, 2013Updated 13 years ago
- calling R from a Rails app☆10Mar 17, 2016Updated 10 years ago
- Python library bindings for the Semantics3 APIs☆21Mar 17, 2022Updated 4 years ago
- ☆51May 21, 2026Updated last month