Machine Learning and Data Analysis Case Studies using Spark.
☆72Mar 22, 2021Updated 5 years ago
Alternatives and similar repositories for Data-Science-with-Spark
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An awesome list of high-quality open datasets in public domains (on-going). By everyone, for everyone!☆33Mar 9, 2017Updated 9 years ago
- Machine Learning Implementations in Python☆65Jun 24, 2021Updated 4 years ago
- Create interactive quizzes using Shiny and Rmarkdown☆16Aug 18, 2024Updated last year
- The objective of the project is to obtain the prediction of delivery date and freight cost based on the historic trend and attributes.The…☆10Aug 1, 2019Updated 6 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Feb 24, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Hypothesis testing (Parametric/Non-Parametric)☆11Oct 8, 2019Updated 6 years ago
- Statistical Hypothesis Testing with the Pingouin Python Library.☆11Aug 25, 2022Updated 3 years ago
- ☆17Mar 18, 2018Updated 8 years ago
- A command-line batch interface to the RuleFit statistical model building program.☆20Jan 30, 2017Updated 9 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- This repository contains the bunch of cheat sheets of diffenrent python libraries which are used in order to develop data science applica…☆20Nov 1, 2017Updated 8 years ago
- Lista de enlaces a datasets relacionados con Colombia☆28Mar 17, 2016Updated 10 years ago
- Minimal example to setup a Jenkins-CI pipeline for data science projects on OpenShift in a couple of minutes.☆27Jan 7, 2025Updated last year
- Personalization with deep learning in 100 lines of code☆15Mar 31, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talk☆17Sep 19, 2017Updated 8 years ago
- ☆12Sep 20, 2017Updated 8 years ago
- Datasets and notebooks☆13Oct 26, 2016Updated 9 years ago
- ☆13Dec 26, 2022Updated 3 years ago
- Build an accurate sentiment model using Python with scikit-learn☆10Sep 8, 2016Updated 9 years ago
- ☆101Jun 25, 2018Updated 7 years ago
- Our solution to the data science hackathon by McKinsey, Prohack by our team D1D, which was ranked 4th on public leaderboard and 25th on p…☆10Jun 21, 2020Updated 5 years ago
- This project, "Detecting Anomaly in ECG Data Using AutoEncoder with PyTorch," focuses on leveraging an LSTM-based Autoencoder for identif…☆16Jan 13, 2024Updated 2 years ago
- The dataset is of a Global Pharmacy Company. The dataset comprises of Historical sales, Product Information and products which need forec…☆28Aug 27, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Data Science Case Studies☆18Jan 31, 2021Updated 5 years ago
- ☆10May 3, 2025Updated 11 months ago
- Example to create lineage in Atlas with sqoop and spark☆14Apr 5, 2017Updated 9 years ago
- Fast Python Collaborative Filtering for Implicit Datasets☆15Oct 17, 2016Updated 9 years ago
- A comprehensive guide to applying statistical techniques in machine learning, including data preprocessing, model development, evaluation…☆27Jan 29, 2025Updated last year
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Nov 16, 2017Updated 8 years ago
- A tutorial for using Hadoop with Python and Hive☆10May 26, 2015Updated 10 years ago
- Galvanize DSI Capstone: Subreddit Recommender☆15Jan 15, 2019Updated 7 years ago
- Infuse AI into your application. Create and deploy a customer churn prediction model with IBM Cloud Private for Data, Db2 Warehouse, Spar…☆18Sep 17, 2025Updated 7 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- colored table output in R terminal☆17Sep 27, 2022Updated 3 years ago
- A repository with different graph processing tehnologies☆11Nov 30, 2015Updated 10 years ago
- Welcome to some case study of data science projects - (Personal Projects).☆23Dec 30, 2025Updated 3 months ago
- Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis☆12Oct 13, 2020Updated 5 years ago
- ☆16Apr 3, 2019Updated 7 years ago
- ☆10Oct 17, 2021Updated 4 years ago
- Resizes text elements proportionally to fit any element☆13Sep 12, 2017Updated 8 years ago