InsightDataScience / Awesome-Data-Engineering-Content
Sharing interesting and noteworthy Data Engineering content
☆65Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Data-Engineering-Content
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 9 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆66Updated 8 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆34Updated 13 years ago
- AWS Big Data Certification☆25Updated last year
- ☆26Updated 7 years ago
- How to build an awesome data engineering team☆99Updated 5 years ago
- ☆37Updated 8 years ago
- A simple introduction to using spark ml pipelines☆26Updated 6 years ago
- pyspark sample scripts☆17Updated 5 years ago
- Learn the pyspark API through pictures and simple examples☆168Updated 3 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆154Updated last week
- ☆26Updated 10 months ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- ETL with Python - Taught at DWH course 2017 (TAU)☆101Updated 7 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 7 years ago
- Updated repository☆157Updated 2 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago
- Course materials for my data pipeline video course with O'Reilly☆194Updated 7 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- A curated list of data science blogs☆44Updated 5 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆86Updated 5 years ago
- Appendix☆14Updated 9 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆118Updated last year
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆114Updated 3 months ago
- Repository used for Spark Trainings☆53Updated last year
- Source material for Data Science for Telecom Tutorial at Strata Singapore 2015☆102Updated 8 years ago