Sharing interesting and noteworthy Data Engineering content
☆69Oct 21, 2016Updated 9 years ago
Alternatives and similar repositories for Awesome-Data-Engineering-Content
Users that are interested in Awesome-Data-Engineering-Content are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo to migrate old wiki to, esp for devs and code examples☆182Oct 18, 2016Updated 9 years ago
- ☆14Jun 27, 2017Updated 8 years ago
- vinyl recommendation engine based on Discogs and engineered data☆16Jul 3, 2017Updated 8 years ago
- Introduction to data analysis using Pandas☆13Nov 20, 2016Updated 9 years ago
- natural language processing with link-grammar☆18Sep 30, 2009Updated 16 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A curated list of data engineering tools for software developers☆8,692May 14, 2026Updated 3 weeks ago
- How to build an awesome data engineering team☆101Sep 11, 2019Updated 6 years ago
- This repo outlines a method for differentiating between anomalies and expected outliers using the Microsoft Anomaly Detection API and Bin…☆10Jun 11, 2017Updated 8 years ago
- Demo of embedding words and documents into vector space☆18Aug 20, 2024Updated last year
- Some thoughts on how to use machine learning in production☆71May 17, 2017Updated 9 years ago
- Amazon Redshift offers a common query interface against data stored in fast, local storage as well as data from high-capacity, inexpensiv…☆13Nov 26, 2018Updated 7 years ago
- Some AWS EMR examples☆16Jan 18, 2018Updated 8 years ago
- Collection of my favorite Python packages from 2020☆11Jan 12, 2021Updated 5 years ago
- Python Machine Learning: Tips, Tricks, and Techniques, published by Packt☆21Jan 15, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Sep 10, 2018Updated 7 years ago
- A list of all projects by UW CSE students.☆10Feb 8, 2016Updated 10 years ago
- distributed rate limiter for traffic control☆12Feb 2, 2018Updated 8 years ago
- PyCon 2016 Tutorial Session -- Making Connections with Natural Language Processing☆12May 26, 2016Updated 10 years ago
- ☆16Oct 23, 2019Updated 6 years ago
- ☆25Aug 23, 2017Updated 8 years ago
- Hello world for writing Ethereum apps!☆11Oct 19, 2017Updated 8 years ago
- Machine learning☆11Jan 12, 2018Updated 8 years ago
- A collection of resources about deep reinforcement learning☆26Feb 24, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A Serverless function for posting to a Slack Webhook in response to a Mailgun route☆11Oct 12, 2016Updated 9 years ago
- ☆22Jul 31, 2019Updated 6 years ago
- Welcome to my independent research repository!☆17Nov 18, 2016Updated 9 years ago
- ☆10May 3, 2025Updated last year
- Big Data for Data Engineers Coursera Specialization from Yandex☆99Mar 15, 2023Updated 3 years ago
- Data Mining and Analytics in Intelligent Business Services, UC Berkeley School of Information☆20May 17, 2013Updated 13 years ago
- A flask based python app offering flight recommendations☆10Sep 26, 2016Updated 9 years ago
- Data Pipeline is a Python application for replicating data from source to target databases☆18Nov 1, 2017Updated 8 years ago
- Notes, Ideas, and Projects related to my Springboard data science career track☆11Jun 23, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jul 22, 2018Updated 7 years ago
- Capture, save, and analyze AWS Redshift performance metrics☆17Oct 6, 2017Updated 8 years ago
- Random implementation notes☆34Apr 23, 2013Updated 13 years ago
- Build end-to-end Machine Learning pipeline to predict accessibility of playgrounds in NYC☆15Jul 9, 2020Updated 5 years ago
- A very WIP decompilation of Shin Megami Tensei 1 for the Playstation.☆13Aug 23, 2025Updated 9 months ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Oct 12, 2016Updated 9 years ago
- A self-paced workshop designed to allow you to get hands on with building a real-time data platform using serverless technologies such as…☆22Dec 3, 2018Updated 7 years ago