AvantFinCo / data-engineer-interview
Do you have what it takes to be an Avant data engineer?
☆13Updated 9 years ago
Alternatives and similar repositories for data-engineer-interview:
Users that are interested in data-engineer-interview are comparing it to the libraries listed below
- Source Material for using Python and Hadoop together☆13Updated 7 years ago
- Size of datasets used for analytics based on 10 years of surveys by KDnuggets.☆16Updated 9 years ago
- ☆24Updated 6 years ago
- ☆34Updated 8 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- Materials for my PyData Seattle talk☆21Updated 9 years ago
- Articles on Data Science, Jupyter, and Pandas☆18Updated 9 years ago
- Site for a Data Science class taught by Allen Downey☆44Updated 2 years ago
- A collection of data science examples implemented across a variety of languages and libraries.☆33Updated 9 years ago
- Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- spark backend for dplyr☆48Updated 9 years ago
- A couple projects using scikit-learn illustrating project decision making.☆15Updated 8 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 8 years ago
- Showcase for using H2O and R for churn prediction (inspired by ZhouFang928 examples)☆58Updated 7 years ago
- Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for…☆20Updated 7 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- EuroScipy 2014 tutorial: Introduction to predictive analytics with pandas and scikit-learn☆84Updated 10 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- ☆11Updated 8 years ago
- Materials for dask talk at PyData NYC☆15Updated 9 years ago
- Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016☆26Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions☆21Updated 7 years ago
- Wiki of links and data science resources started in datascientists.slack.com☆14Updated 9 years ago
- Materials fort Strata NYC 2016 scikit-learn tutorial☆15Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- Scripts to Analyze Pronto's Data Release☆24Updated 9 years ago
- Fast, accurate, lightweight, multi-core ML in Python, leveraging Vowpal Wabbit☆21Updated 6 years ago