AvantFinCo / data-engineer-interview
Do you have what it takes to be an Avant data engineer?
☆13Updated 9 years ago
Alternatives and similar repositories for data-engineer-interview:
Users that are interested in data-engineer-interview are comparing it to the libraries listed below
- Size of datasets used for analytics based on 10 years of surveys by KDnuggets.☆16Updated 9 years ago
- Source Material for using Python and Hadoop together☆13Updated 7 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- ☆34Updated 8 years ago
- Updated 9 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- ☆24Updated 6 years ago
- Analyzing Clickstream Data using Markov Chains and data mining SPACE algorithm☆29Updated 6 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Small R package for accessing Redshift☆68Updated 8 years ago
- PolYamoR is the first forward-reverse automated translation system between Python and R☆16Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Analytics on Apache Projects for Diversity☆18Updated 5 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Materials for dask talk at PyData NYC☆15Updated 9 years ago
- Materials for the R Programming Workshop that I teach at The University of Chicago☆34Updated 9 years ago
- Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)☆58Updated 7 years ago
- open source version of the Bonsai library☆26Updated 8 years ago
- Simple scripts to setup a fresh data science box using an Ubuntu 12.04.* LTS 64-bit server running on an EC2☆163Updated 11 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Because you're computing conversion rates wrong☆16Updated 7 years ago
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 8 years ago
- ☆20Updated 7 years ago
- An in depth tutorial on sklearn's Pipeline and FeatureUnion classes.☆16Updated 7 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Materials for the PyData San Francisco 2016 visualization tutorial☆14Updated 8 years ago
- 12 Week Data Science Immersive☆27Updated 9 years ago