gpestana / the-zen-of-data-pipelines
exploring how to design, build and maintaining sane data pipelines
☆22Updated 7 years ago
Alternatives and similar repositories for the-zen-of-data-pipelines:
Users that are interested in the-zen-of-data-pipelines are comparing it to the libraries listed below
- Interesting papers I'd like to implement (or at least have implementations of)☆122Updated 3 years ago
- Personal collection of interesting links & research papers related to data science (particularly in the area of Deep Learning)☆41Updated 9 years ago
- Hadoop training material from free MapR courses.☆54Updated 7 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 3 years ago
- Quickly detect already witnessed data.☆157Updated 6 months ago
- Material for some talks I have given☆62Updated 4 months ago
- All Kaggle competitions☆91Updated 8 years ago
- We're All Database Engineers☆15Updated 8 years ago
- An analysis of historical Hacker News data to determine the ranking algorithm☆85Updated 7 years ago
- Implementations of mathematical functions, formulas and concepts☆92Updated 5 years ago
- Sharing interesting and noteworthy Data Engineering content☆66Updated 8 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago
- Topic Modeling over Paul Graham's essays☆12Updated 6 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- ☆24Updated 6 years ago
- A module which fairly distributes a list of arbitrary objects among a set of targets, considering weights.☆77Updated 7 years ago
- beer recommendation engine project for Metis☆18Updated 2 years ago
- Download *ALL* the submissions from Hacker News☆50Updated 10 years ago
- A Python implementation of a political forecasting model by Scholz, Calbert & Smith.☆11Updated 8 years ago
- Good reads for an overall understanding of distributed systems. SBU Fall 2014.☆42Updated 9 years ago
- Bloom Filter Demo☆21Updated 7 years ago
- A smart bot for Slack that helps users stay on topic.☆54Updated 7 years ago
- Web client for Babelfish server☆23Updated 2 years ago
- Collection of pointers to slides and repositories from speakers at PyData Berlin 2016☆37Updated 8 years ago
- A simple data consistency checker☆30Updated 8 years ago
- Log mailer is a program I made to email log files.☆46Updated 6 years ago