vsmolyakov / pyspark
spark (scala and python)
☆18Updated 5 years ago
Alternatives and similar repositories for pyspark:
Users that are interested in pyspark are comparing it to the libraries listed below
- ☆15Updated 2 years ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Updated 6 years ago
- pyspark sample scripts☆17Updated 6 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Bayesian statistics seminars☆30Updated 7 years ago
- ☆30Updated 7 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 7 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- ☆15Updated 6 years ago
- I am teaching a Learning ML workshop for some folks @ Belong.co. Creating this repo to organise the course material.☆23Updated 6 years ago
- Teaching materials for the text analytics course☆19Updated 6 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 6 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Machine Learning encoders for feature transformation & engineering: target encoder, weight of evidence, label encoder.☆23Updated 4 years ago
- Companion code for my PyData talk: "Introduction to Probabilistic Programming with PyMC3"☆13Updated 5 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- Repository for medium article☆22Updated last year
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago
- Python package for dynamic system estimation of time series☆40Updated 4 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Using Luigi to create a Machine Learning Pipeline using the Rossman Sales data from Kaggle☆33Updated 8 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- Sample techniques for a variety of feature extraction methods☆32Updated 3 years ago
- Project template for highly effective data science workflows☆29Updated 11 months ago
- Repo for PyData 2019 Tutorial - New Trends in Estimation and Inference☆25Updated 5 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Pandas integration with sklearn☆21Updated 8 years ago
- Material for UW Extension Data Science 350☆19Updated 7 years ago