Code for Packt Publishing's Spark for Data Science Cookbook.
☆22Jun 19, 2017Updated 8 years ago
Alternatives and similar repositories for SparkforDataScienceCookbook
Users that are interested in SparkforDataScienceCookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆11Mar 27, 2017Updated 9 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆12Mar 10, 2020Updated 6 years ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Jan 30, 2023Updated 3 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Oct 31, 2022Updated 3 years ago
- ☆104Nov 26, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆26Jan 2, 2024Updated 2 years ago
- ☆24Apr 29, 2016Updated 10 years ago
- Apache Spark 2 for Beginners, published by Packt☆33Oct 31, 2022Updated 3 years ago
- Multi Channel Attribution☆10Mar 7, 2017Updated 9 years ago
- A standalone Magento DevOps environment built with Vagrant and Puppet from a vanilla Ubuntu 12.04 LTS box.☆39Feb 10, 2014Updated 12 years ago
- Code repository for Practical Machine Learning Cookbook, published by Packt☆35Jan 30, 2023Updated 3 years ago
- Data Science and Machine Learning with Python - Hands On from Udemy☆14May 11, 2017Updated 9 years ago
- Building Recommendation Engines by Packt☆34Jan 14, 2021Updated 5 years ago
- This example uses the lightfm recommender system library to train a hybrid content-based + collaborative algorithm that uses the WARP los…☆10Mar 24, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Aug 20, 2018Updated 7 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 10 months ago
- It consists of all code examples discussed as part of deep learning course taken at algorithmica☆11Oct 1, 2020Updated 5 years ago
- My terminal setup and config files☆14Oct 30, 2018Updated 7 years ago
- GitHub notifications tracker for Telegram. Pushes GitHub notifications to Telegram.☆13Jan 25, 2018Updated 8 years ago
- Simple sentiment analysis model with PySpark☆43Mar 13, 2018Updated 8 years ago
- ☆11Jan 30, 2023Updated 3 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Jun 29, 2017Updated 8 years ago
- A genetic algorithm to optimize your baseball and football daily fantasy sports lineups☆16Nov 10, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Redshift Proof of Concepts☆10Aug 27, 2015Updated 10 years ago
- ☆11Aug 6, 2018Updated 7 years ago
- Accept Stripe payments in Magento 1☆20Jun 18, 2018Updated 7 years ago
- Inchoo_CustomLinkedProducts Magento module☆28Sep 14, 2015Updated 10 years ago
- Java course from Tallinn Technical University, 2005-2011☆12Dec 26, 2011Updated 14 years ago
- SparkER: an Entity Resolution framework for Apache Spark☆66Mar 29, 2024Updated 2 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆58Feb 4, 2016Updated 10 years ago
- Dynamic pricing for selling perishable goods☆65Dec 7, 2017Updated 8 years ago
- Call Detail Record (CDR) is the information captured by the telecom companies during the Call, SMS and Internet activity. These informati…☆17Jun 8, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- This repository includes end-to-end labs on how to use GCP for applied data science☆14Aug 28, 2018Updated 7 years ago
- Use Machine Learning to infer best betting strategy to play the NBA fantasy sport game on DraftKings☆20Jun 16, 2022Updated 3 years ago
- Tensor-based Spectral LDA on Spark☆18Jun 5, 2018Updated 7 years ago
- Implementation of a Recommendation Engine for Reddit☆12Nov 19, 2014Updated 11 years ago
- A small demo project that users can deploy to Modulus.io☆18Feb 9, 2015Updated 11 years ago
- Mastering Apache Spark 2x, published by Packt☆17Jan 30, 2023Updated 3 years ago