vsmolyakov / pysparkLinks
spark (scala and python)
☆18Updated 5 years ago
Alternatives and similar repositories for pyspark
Users that are interested in pyspark are comparing it to the libraries listed below
Sorting:
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 3 years ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆68Updated 7 years ago
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- ☆30Updated 7 years ago
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆26Updated 6 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago
- Multiple linear regression with statistical inference, residual analysis, direct CSV loading, and other features☆34Updated 6 years ago
- Common data science and data engineering utilities to help us perform analytics. Our toolbox for data scientists, licensed under Apache-2…☆30Updated 7 years ago
- Visualization ideas for data science☆20Updated 7 years ago
- DeepLearningfromScratch2018☆20Updated 7 years ago
- Contains code for understanding TensorFlow workflow and basics☆51Updated 7 years ago
- KnowledgeRepo + JupyterLab☆48Updated 2 weeks ago
- Doing Bayesian statistics in Python!☆67Updated 7 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆105Updated 2 years ago
- Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.☆105Updated 6 years ago
- Bayesian statistics seminars☆29Updated 8 years ago
- Slides and notebooks for my tutorial at PyData London 2018☆21Updated 7 years ago
- ☆16Updated 6 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆39Updated 2 years ago
- This is all my random garbage.☆26Updated 2 years ago
- ☆90Updated 4 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 9 years ago
- Work for Mastering Large Datasets with Python☆20Updated 2 years ago
- ☆26Updated 8 years ago
- Survival Analysis with non-parametric, semi-parametric, and parametric models☆40Updated 7 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- Tutorial on multilevel modeling, using Gelman radon example☆58Updated 10 years ago
- PyDataLondonTutorial☆26Updated 9 years ago
- Deploy AutoML as a service using Flask☆226Updated 8 years ago
- Bayesian Inference and parameter estimation in quant finance.☆43Updated 6 years ago