Code for Packt Publishing's Spark for Data Science Cookbook.
☆22Jun 19, 2017Updated 8 years ago
Alternatives and similar repositories for SparkforDataScienceCookbook
Users that are interested in SparkforDataScienceCookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- Mastering Spark for Data Science, published by Packt☆50Apr 22, 2026Updated last month
- ☆104Nov 26, 2019Updated 6 years ago
- High-Performance Computing with Python 3.x, published by Packt☆15Dec 15, 2025Updated 5 months ago
- ☆26Jan 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆24Apr 29, 2016Updated 10 years ago
- Apache Spark 2 for Beginners, published by Packt☆33Oct 31, 2022Updated 3 years ago
- Multi Channel Attribution☆10Mar 7, 2017Updated 9 years ago
- A standalone Magento DevOps environment built with Vagrant and Puppet from a vanilla Ubuntu 12.04 LTS box.☆39Feb 10, 2014Updated 12 years ago
- Cis Recommender☆16May 1, 2012Updated 14 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14May 11, 2017Updated 9 years ago
- Play framework template based on SB-Admin-2☆13Mar 13, 2015Updated 11 years ago
- This example uses the lightfm recommender system library to train a hybrid content-based + collaborative algorithm that uses the WARP los…☆10Mar 24, 2017Updated 9 years ago
- ☆10Aug 20, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Introduction to Data Science - Bill Howe -- Spring 2013/4☆13Sep 17, 2014Updated 11 years ago
- Using Twitter Sentiment Analysis Data & Santiment's Blockchain Activity Data to to Multivariate Time Series Forecasting on Altcoin's USD …☆14Feb 10, 2023Updated 3 years ago
- An example of how to create modules in Play 2.4.x or 2.3.x☆12Jul 27, 2015Updated 10 years ago
- Google Analytics plugin for sending events to Snowplow☆17Sep 30, 2020Updated 5 years ago
- SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms☆18Aug 23, 2018Updated 7 years ago
- web crawler☆14Sep 27, 2022Updated 3 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 10 months ago
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- Spark with Scala example projects☆34Apr 17, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My terminal setup and config files☆14Oct 30, 2018Updated 7 years ago
- Simple sentiment analysis model with PySpark☆43Mar 13, 2018Updated 8 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- ☆11Jan 30, 2023Updated 3 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- A genetic algorithm to optimize your baseball and football daily fantasy sports lineups☆16Nov 10, 2017Updated 8 years ago
- Sequence to Sequence Learning Model☆14Jan 9, 2016Updated 10 years ago
- A backgammon game using scala, the play-framework, websockets, and d3.js☆23Dec 1, 2012Updated 13 years ago
- Accept Stripe payments in Magento 1☆20Jun 18, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Ambari Service definition for an Jupyter (IPython3) Notebook service☆41Jul 1, 2016Updated 9 years ago
- Anomaly Detection in Network Traffic using different clustering algorithm.☆18Jun 8, 2017Updated 9 years ago
- The data pipeline services extracting & transforming data from our museum and collections.☆16Updated this week
- Churn Prediction with PySpark using MLlib and ML Packages☆58Feb 4, 2016Updated 10 years ago
- Detect duplicated items。内容排重框架。☆11Apr 30, 2015Updated 11 years ago
- Advanced Layered Navigation for Magento2☆18May 26, 2017Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago