vsmolyakov / pysparkLinks
spark (scala and python)
☆18Updated 6 years ago
Alternatives and similar repositories for pyspark
Users that are interested in pyspark are comparing it to the libraries listed below
Sorting:
- This is all my random garbage.☆26Updated 2 years ago
- ☆30Updated 8 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 5 years ago
- basic pandas tutorials☆52Updated 8 years ago
- Bayesian statistics seminars☆29Updated 8 years ago
- Set of iPython and Jupyter extensions to improve user experience☆50Updated 6 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 8 years ago
- ☆26Updated last year
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆52Updated 9 years ago
- A tutorial to create python based prediction web app☆30Updated 5 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- Slides and notebooks for my tutorial at PyData London 2018☆21Updated 7 years ago
- Accelerate data science☆117Updated 4 years ago
- ☆14Updated 6 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- Code repository supporting the medium blog☆12Updated 5 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Slides and materials for most of my talks by year☆92Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆36Updated 5 years ago
- Contains code for understanding TensorFlow workflow and basics☆51Updated 7 years ago
- Files for Python Talk☆24Updated 9 years ago
- Pandas integration with sklearn☆21Updated 8 years ago
- REST API (and possible UI) for Machine Learning workflows☆61Updated 6 years ago
- Python package for dynamic system estimation of time series☆40Updated 5 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 7 years ago
- Doing Bayesian statistics in Python!☆67Updated 7 years ago
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆26Updated 6 years ago
- ☆19Updated 4 years ago
- Predict whether a student will correctly answer a problem based on past performance using automated feature engineering☆32Updated 5 years ago