gpestana / the-zen-of-data-pipelines
exploring how to design, build and maintaining sane data pipelines
☆22Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for the-zen-of-data-pipelines
- A Python implementation of a political forecasting model by Scholz, Calbert & Smith.☆11Updated 8 years ago
- Topic Modeling over Paul Graham's essays☆12Updated 6 years ago
- A module which fairly distributes a list of arbitrary objects among a set of targets, considering weights.☆77Updated 7 years ago
- A smart bot for Slack that helps users stay on topic.☆54Updated 7 years ago
- Deep learning certificate part 1☆10Updated 2 years ago
- Example scripts for various deep learning APIs.☆28Updated 9 years ago
- Notebook version of an article on the Fast Forward Labs blog☆61Updated 7 years ago
- An analysis of historical Hacker News data to determine the ranking algorithm☆85Updated 7 years ago
- Hadoop training material from free MapR courses.☆54Updated 7 years ago
- beer recommendation engine project for Metis☆18Updated 2 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆57Updated 3 years ago
- Javascript Developer library for interacting with Computable Protocol☆28Updated last year
- Material for some talks I have given☆62Updated last month
- Quickly detect already witnessed data.☆157Updated 3 months ago
- ☆10Updated 8 years ago
- This repository contains research we conduct at Vocapouch we want to share with the world.☆22Updated 7 years ago
- The ultimate twitter streaming data collector☆40Updated 8 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆30Updated 6 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 8 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 9 years ago
- Datasets and notebooks☆13Updated 8 years ago
- Generating the next read for our book club- with Data Science!☆40Updated 8 years ago
- Presentation and code from Cryptocurrency Technical Trading strategy meeting. Dec 7th 2017☆16Updated 6 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- knyfe is a python utility for rapid exploration of datasets.☆54Updated 9 years ago
- Introduction to common Probabilistic Algorithms: Approximate Counting, Flajolet-Martin, LogLog, HyperLogLog, Bloom Filters☆60Updated 7 years ago
- Load a linkedin network w/ python py2neo into a neo4j database, serve it via node.js, and display it w/ sigma.js☆29Updated 11 years ago
- ☆24Updated 8 years ago