Repo to migrate old wiki to, esp for devs and code examples
☆183Oct 18, 2016Updated 9 years ago
Alternatives and similar repositories for data-engineering-ecosystem
Users that are interested in data-engineering-ecosystem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sharing interesting and noteworthy Data Engineering content☆69Oct 21, 2016Updated 9 years ago
- VM based deployment for prototyping Big Data tools on Amazon Web Services☆130May 15, 2020Updated 6 years ago
- ☆14Jun 27, 2017Updated 8 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆62Feb 23, 2015Updated 11 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A way for home buyers to know about factors affecting a state☆48Mar 2, 2019Updated 7 years ago
- Clone Git repository faster. Eliminates the repetitive typing of git clone and copy-pasting the url☆16Dec 17, 2017Updated 8 years ago
- ☆25Aug 23, 2017Updated 8 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898May 8, 2022Updated 4 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆12Dec 3, 2024Updated last year
- A curated list of data engineering tools for software developers☆8,650Updated this week
- Building Scio from scratch step by step☆20May 20, 2019Updated 7 years ago
- ☆31Jun 4, 2020Updated 5 years ago
- How to build an awesome data engineering team☆101Sep 11, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆29Nov 4, 2017Updated 8 years ago
- Udacity Data Engineering Nano Degree (DEND)☆188Jan 20, 2020Updated 6 years ago
- Examples of deploying scikit, spaCy and Keras (TensorFlow) machine learning models to AWS Lambda with Serverless framework and Python 3.☆31Dec 8, 2022Updated 3 years ago
- My Data Engineering project @ Insight Data Science☆10Jul 23, 2018Updated 7 years ago
- Variational Factorization Machines☆17Dec 20, 2016Updated 9 years ago
- Luigi integration for Google BigQuery☆15Nov 18, 2015Updated 10 years ago
- Example end to end data engineering project.☆1,410Dec 8, 2022Updated 3 years ago
- Provides different code samples for Apache Beam and DataFlow☆14Sep 29, 2023Updated 2 years ago
- Twitter-Kafka Data Pipeline☆16Nov 19, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DEPRECATED! - utility classes to set up logging in a Spotify compatible way☆24Jan 21, 2025Updated last year
- ☆11Jan 8, 2023Updated 3 years ago
- Introduction to Scientific Python☆15Oct 10, 2019Updated 6 years ago
- A library, that provides Conflict Free Replicated Data Types (CRDTs) for distributed Python applications.☆17Jan 10, 2019Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Mar 11, 2017Updated 9 years ago
- Some thoughts on how to use machine learning in production☆71May 17, 2017Updated 9 years ago
- The Data Engineering Cookbook☆15,088Jan 17, 2026Updated 4 months ago
- Cloudformation template for deploying Presto on AWS☆13Jul 20, 2020Updated 5 years ago
- A list of useful resources to learn Data Engineering from scratch☆3,993Jun 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Jun 15, 2023Updated 2 years ago
- Resources for software/backend/data learning | #SE | #DE | #DS☆17Nov 16, 2025Updated 6 months ago
- Companion code for the Mastering Advanced Scala book https://leanpub.com/mastering-advanced-scala☆35Mar 20, 2021Updated 5 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆166Jun 16, 2020Updated 5 years ago
- ☆13Feb 26, 2025Updated last year
- Source code for 'Practical Business Analytics Using SAS' by Shailendra Kadre and Venkat Reddy Konasani☆10Mar 28, 2017Updated 9 years ago