wrobstory / DataEngArchSimple
PDX Data Science Meetup March 2016 Presentation
☆10Updated 8 years ago
Alternatives and similar repositories for DataEngArchSimple:
Users that are interested in DataEngArchSimple are comparing it to the libraries listed below
- Source Material for using Python and Hadoop together☆13Updated 7 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- Presentation at Perth Data Science Meetup, February 2015☆72Updated 9 years ago
- Small R package for accessing Redshift☆68Updated 8 years ago
- PyData NYC 2015 conference☆94Updated 9 years ago
- ☆34Updated 8 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 11 years ago
- Portland Python Meetup March 2015☆40Updated 9 years ago
- Repository for exploratory data transformation & visualization talk☆27Updated 8 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 8 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆239Updated last week
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 5 years ago
- VM Setup stuff for http://bit.ly/22giU4y☆9Updated 8 years ago
- repository for code related to the end-to-end data analysis in python workshop, from the Open Data Science Conference 2015☆15Updated 9 years ago
- Template for creating a tweetbot with AWS Lambda☆39Updated 7 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆89Updated 9 years ago
- Material for some talks I have given☆62Updated 5 months ago
- A dashboard of key metrics for the USA☆69Updated 6 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- field experiments tutorial☆27Updated 10 years ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…☆41Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 4 years ago
- ☆84Updated 6 years ago