vsmolyakov / pysparkLinks
spark (scala and python)
☆18Updated 5 years ago
Alternatives and similar repositories for pyspark
Users that are interested in pyspark are comparing it to the libraries listed below
Sorting:
- ☆26Updated last year
- ☆15Updated 2 years ago
- ☆30Updated 7 years ago
- Brian Farris' Talk on Reinforcement Learning and Multi-Armed Bandits for the Data Incubator☆30Updated 7 years ago
- Tutorial repo for the article "ML in Production"☆30Updated 2 years ago
- pyspark sample scripts☆17Updated 6 years ago
- My work on UCSD CSE 250B Principles of Artificial Intelligence: Learning Algorithms☆13Updated 5 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Public course material☆35Updated 6 years ago
- Predict taxi trip duration based on historical trips using automated feature engineering☆62Updated 4 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- ☆16Updated 4 years ago
- Code to 1) scrap wikipedia page view counts, and to 2) conduct time series analysis with GAM☆47Updated 7 years ago
- This is a work in progress Pytorch implementation of the recently proposed ES-RNN by Slawek Smyl, winner of the M4 competition☆12Updated 6 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- 32/2384 Solution to Kaggle Mercari Competition (solo silver medal winner)☆21Updated 7 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14Updated 8 years ago
- Bayesian statistics seminars☆30Updated 8 years ago
- Sample Notebooks for PipelineAI☆44Updated 2 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 7 years ago
- PySpark Machine Learning Examples☆44Updated 7 years ago
- Show how to perform fast retraining with LightGBM in different business cases☆54Updated 5 years ago
- Building an API with the FastAPI framework to serve a scikit-learn model.☆18Updated 6 years ago
- This is all my random garbage.☆26Updated last year
- notebooks for nlp-on-spark☆13Updated 8 years ago
- Predict the poverty of households in Costa Rica using automated feature engineering.☆23Updated 4 years ago
- Materials for Machine Learning with H2O Open Platform at ODSC Masterclass Summit 2017☆12Updated 8 years ago
- Machines and people collaborating together through Jupyter notebooks.☆18Updated 7 years ago
- helpful resources for (big) data science☆33Updated 3 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago