roman79 / DublinDataEngineering
The Open Source resources in Data Engineering, Machine Learning, Data Science areas, inspired by [The Open-Source Data Science Masters] (http://datasciencemasters.org/).
☆8Updated 7 years ago
Alternatives and similar repositories for DublinDataEngineering:
Users that are interested in DublinDataEngineering are comparing it to the libraries listed below
- Notes, Ideas, and Projects related to my Springboard data science career track☆11Updated 7 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 8 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- ☆18Updated 7 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 5 years ago
- A tutorial to create python based prediction web app☆30Updated 4 years ago
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- This is the presentation on - What are the key points one should consider if they will be appearing in Data Science job interview☆40Updated 6 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Some AWS EMR examples☆16Updated 7 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Systems Puzzle for the Insight DevOps Engineering program☆5Updated 6 years ago
- Real-time fraud detection in venmo payments.☆27Updated 7 years ago
- Simple sentiment analysis model with PySpark☆42Updated 7 years ago
- Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC☆15Updated 4 years ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Example of using Airflow to schedule downloading data form S3 and launching spark jobs☆15Updated 8 years ago
- ☆63Updated 6 years ago
- 🛠️ My solutions to Datacamp Projects☆9Updated 6 years ago
- Contains code and presentation for my interactive hack session, 'Effective Feature Engineering: A Structured Approach to Building Better …☆30Updated 4 years ago
- In the Data Science and Engineering program, engineering professionals combine the skills of software programmer, database manager, and s…☆27Updated 7 years ago
- Build and Deploy Machine Learning Models on the Cloud☆17Updated 7 years ago
- Example custom model image trainable and distributable via AWS SageMaker☆36Updated last year
- Data Engineering Capstone☆16Updated 5 years ago
- Data Science Mini-Projects with Python☆15Updated 7 years ago
- It consists of examples, assignments discussed in data science course taken at algorithmica.☆108Updated 8 months ago
- ☆46Updated 3 years ago