gpestana / the-zen-of-data-pipelines
exploring how to design, build and maintaining sane data pipelines
☆22Updated 7 years ago
Alternatives and similar repositories for the-zen-of-data-pipelines:
Users that are interested in the-zen-of-data-pipelines are comparing it to the libraries listed below
- Sharing interesting and noteworthy Data Engineering content☆66Updated 8 years ago
- A primer for data science tools in Python☆56Updated 5 years ago
- Personal collection of interesting links & research papers related to data science (particularly in the area of Deep Learning)☆41Updated 9 years ago
- Interesting papers I'd like to implement (or at least have implementations of)☆122Updated 3 years ago
- Hadoop training material from free MapR courses.☆54Updated 7 years ago
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆262Updated 7 years ago
- Reading list of research papers on blockchains, P2P networks, consensus etc☆80Updated 8 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- Schedule for talks, workshops, etc. w/ links to past talk slides and videos.☆26Updated 7 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Implementations of mathematical functions, formulas and concepts☆92Updated 5 years ago
- beer recommendation engine project for Metis☆18Updated 2 years ago
- Material for some talks I have given☆62Updated 5 months ago
- Python CLI to apply word2vec to all sorts of text documents.☆48Updated 7 years ago
- decentralized defense against changing data☆18Updated 8 years ago
- IPython notebooks of common data structures and algorithms☆73Updated 8 years ago
- ☆46Updated 7 years ago
- Collection of pointers to slides and repositories from speakers at PyData Berlin 2016☆37Updated 8 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- ☆66Updated 7 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 7 years ago
- A Python implementation of a political forecasting model by Scholz, Calbert & Smith.☆11Updated 8 years ago
- Javascript Developer library for interacting with Computable Protocol☆28Updated 2 years ago
- Docker-izing Data Science Applications CodeLab for QCon AI 2018☆13Updated 6 years ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 7 years ago
- Anomaly detection training suite☆119Updated 9 years ago
- Machine Learning papers from arXiv hosted on IPFS website.☆45Updated 7 years ago
- A curated list of awesome datasets for papers/experiments/validation.☆90Updated 8 years ago
- Good reads for an overall understanding of distributed systems. SBU Fall 2014.☆42Updated 9 years ago
- Python forecasting and smoothing library☆67Updated 5 years ago