Sharing interesting and noteworthy Data Engineering content
☆70Oct 21, 2016Updated 9 years ago
Alternatives and similar repositories for Awesome-Data-Engineering-Content
Users that are interested in Awesome-Data-Engineering-Content are comparing it to the libraries listed below
Sorting:
- Repo to migrate old wiki to, esp for devs and code examples☆183Oct 18, 2016Updated 9 years ago
- ☆14Jun 27, 2017Updated 8 years ago
- vinyl recommendation engine based on Discogs and engineered data☆16Jul 3, 2017Updated 8 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆35Oct 12, 2011Updated 14 years ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 10 years ago
- Miscellaneous Projects☆16Sep 20, 2020Updated 5 years ago
- A curated list of data engineering tools for software developers☆8,385Feb 21, 2026Updated 3 weeks ago
- How to build an awesome data engineering team☆101Sep 11, 2019Updated 6 years ago
- ☆11Jan 20, 2023Updated 3 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Jun 15, 2023Updated 2 years ago
- Best practices for engineering ML pipelines.☆36Jun 20, 2022Updated 3 years ago
- 😎 Awesome list about all things in front end we use here at Dude☆12Oct 6, 2022Updated 3 years ago
- ☆12Nov 4, 2023Updated 2 years ago
- Solutions for the SQL problems presents in the book "SQL Practice Problems: 57 beginning, intermediate, and advanced challenges for you t…☆52Mar 3, 2020Updated 6 years ago
- This repo outlines a method for differentiating between anomalies and expected outliers using the Microsoft Anomaly Detection API and Bin…☆10Jun 11, 2017Updated 8 years ago
- Some thoughts on how to use machine learning in production☆71May 17, 2017Updated 8 years ago
- Amazon Redshift offers a common query interface against data stored in fast, local storage as well as data from high-capacity, inexpensiv…☆13Nov 26, 2018Updated 7 years ago
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- ☆10Feb 15, 2017Updated 9 years ago
- Python library bindings for the Semantics3 APIs☆21Mar 17, 2022Updated 4 years ago
- CLI for creating databases for Data Quality Dashboards.☆19Oct 26, 2019Updated 6 years ago
- distributed rate limiter for traffic control☆13Feb 2, 2018Updated 8 years ago
- PyCon 2016 Tutorial Session -- Making Connections with Natural Language Processing☆12May 26, 2016Updated 9 years ago
- "Programmers are not to be measured by their ingenuity and their logic but by the completeness of their case analysis." ― Alan J. Perlis☆96Jan 8, 2022Updated 4 years ago
- Computing term cooccurrence in MEDLINE☆18Apr 13, 2021Updated 4 years ago
- scripts for personal reference☆19Dec 26, 2022Updated 3 years ago
- ☆16Oct 23, 2019Updated 6 years ago
- ☆26Aug 23, 2017Updated 8 years ago
- Hello world for writing Ethereum apps!☆11Oct 19, 2017Updated 8 years ago
- Signature Extractor☆11Mar 9, 2023Updated 3 years ago
- Machine learning☆11Jan 12, 2018Updated 8 years ago
- Welcome to my independent research repository!☆17Nov 18, 2016Updated 9 years ago
- how I FOIA (and maybe how you can too!)☆21Mar 20, 2018Updated 8 years ago
- Learn to use the Unix command-line tools and Bash shell scripting☆27Apr 25, 2020Updated 5 years ago
- ☆10May 3, 2025Updated 10 months ago
- Big Data for Data Engineers Coursera Specialization from Yandex☆101Mar 15, 2023Updated 3 years ago
- Data Mining and Analytics in Intelligent Business Services, UC Berkeley School of Information☆20May 17, 2013Updated 12 years ago
- ☆11Aug 2, 2019Updated 6 years ago
- Implementation of basic algorithms in Python, including error handling and basic OOP concepts.☆14Oct 17, 2017Updated 8 years ago