Machine Learning and Data Analysis Case Studies using Spark.
☆72Mar 22, 2021Updated 5 years ago
Alternatives and similar repositories for Data-Science-with-Spark
Users that are interested in Data-Science-with-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Science Learning Notes☆11Oct 18, 2023Updated 2 years ago
- Machine Learning Implementations in Python☆65Jun 24, 2021Updated 4 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Feb 24, 2019Updated 7 years ago
- Statistical Hypothesis Testing with the Pingouin Python Library.☆11Aug 25, 2022Updated 3 years ago
- Repository used for Spark Trainings☆54Apr 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Mar 18, 2018Updated 8 years ago
- Data science analysis for Etsy Marketplace data☆23Aug 25, 2016Updated 9 years ago
- - A repository for applying ML to optimize supply chain management, covering demand forecasting, inventory, logistics, and supplier selec…☆25Mar 20, 2024Updated 2 years ago
- Extracting LinkedIn comments from any post and export it to Excel file☆23Oct 17, 2018Updated 7 years ago
- This repository contains the bunch of cheat sheets of diffenrent python libraries which are used in order to develop data science applica…☆20Nov 1, 2017Updated 8 years ago
- This repository demonstrates how data science can help to identify the employee attrition which is part of Human Resource Management☆15May 20, 2019Updated 6 years ago
- Repo for the Deep Reinforcement Learning Nanodegree program☆12Jun 12, 2023Updated 2 years ago
- Kaggling Home Credit Default Risk in a pipeline fashion.☆12Sep 20, 2018Updated 7 years ago
- This repository focuses on saving my linkedin articles and stuff that I find "USEFUL" on LinkedIn.☆156Jan 18, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- I'm learning how to build data pipelines to work with large datasets. (:☆14Mar 4, 2022Updated 4 years ago
- json files and rest calls to add custom atlas types and create entities☆12Mar 27, 2017Updated 9 years ago
- Our solution to the data science hackathon by McKinsey, Prohack by our team D1D, which was ranked 4th on public leaderboard and 25th on p…☆10Jun 21, 2020Updated 5 years ago
- Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa …☆11Apr 21, 2023Updated 2 years ago
- ROS node for the Parrot Bebop (1/2) remote operation☆12Apr 30, 2019Updated 6 years ago
- List of interesting links about ML Algorithms, Data Science, Network Analysis, and others.☆13May 9, 2023Updated 2 years ago
- spark流数据处理,可以从flume-ng,kafka接收数据☆11Sep 16, 2015Updated 10 years ago
- Reddit Data Science Project Ideas☆11Dec 28, 2019Updated 6 years ago
- Data Science Case Studies☆18Jan 31, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10May 3, 2025Updated 10 months ago
- A comprehensive guide to applying statistical techniques in machine learning, including data preprocessing, model development, evaluation…☆27Jan 29, 2025Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- Slides and code for PyCon Canada 2016 talk.☆11Nov 12, 2016Updated 9 years ago
- Getting started with machine learning.☆15Dec 25, 2019Updated 6 years ago
- An example showing how to integrate Apache Kafka with Akka Streams and Akka HTTP.☆15Sep 28, 2016Updated 9 years ago
- Bebop 2 custom firmware☆17Jul 7, 2019Updated 6 years ago
- Infuse AI into your application. Create and deploy a customer churn prediction model with IBM Cloud Private for Data, Db2 Warehouse, Spar…☆18Sep 17, 2025Updated 6 months ago
- A list of take home challenges for data science interviews I've compiled☆17Sep 6, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- datascience oriented utilities: histograms, aggregations, plots, data manipulation, and other common tasks.☆40Apr 19, 2023Updated 2 years ago
- ☆16Apr 3, 2019Updated 6 years ago
- Resizes text elements proportionally to fit any element☆13Sep 12, 2017Updated 8 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated last week
- Hellinger distance decision tree☆10Jun 15, 2015Updated 10 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆58Feb 4, 2016Updated 10 years ago
- Group project for the WorldQuant University module, risk management.☆13Feb 3, 2019Updated 7 years ago