geekidharsh / predicting-harddrive-failures-using-ml
Predicting Hard Drive failure using SMART Metrics
☆8Updated 4 years ago
Related projects: ⓘ
- It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged fo…☆11Updated 3 years ago
- Tools for extracting metadata from Tableau Desktop workbook files.☆11Updated 2 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆13Updated 5 years ago
- classify crime into different categories using PySpark☆21Updated 5 years ago
- Explore integration between Watson Studio and Cognos Analytics☆13Updated 4 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆12Updated last year
- SQL☆13Updated 7 years ago
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆13Updated 2 years ago
- Problem Statement The objective of this task is to detect hate speech in tweets. For the sake of simplicity, we say a tweet contains hate…☆12Updated 5 years ago
- IBGE - Censo 2010 - Localização e respectivo Código de Setor Censitário☆10Updated 3 years ago
- Code for my presentation: Using PySpark to Process Boat Loads of Data☆20Updated 6 years ago
- MLinProduction SageMaker workshop hosted in April 2020☆15Updated 4 years ago
- PySpark, Databrick, h2o, MLlib☆18Updated 8 years ago
- Contains source files used in the Spark with Python course☆18Updated 5 years ago
- A sample node.js project for beginners.☆11Updated last month
- A portfolio of useful Tableau visualizations and dashboard are located in this repo. The README.md file within this repo contains a summa…☆17Updated 5 years ago
- Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.☆12Updated 5 years ago
- Capstone Project for Udacity Data Engineering Nanodegree☆9Updated 5 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆14Updated last year
- ☆16Updated last year
- Collection of Databricks and Jupyter Notebooks☆22Updated 6 months ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆37Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 3 years ago
- A simple Spark TDD example☆25Updated 7 years ago
- pyspark dataframe made easy☆15Updated 2 years ago
- ☆11Updated 4 years ago
- ☆24Updated this week
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11Updated 6 years ago
- Create Interactive Dashboards with Streamlit and Python Coursera☆10Updated 4 years ago
- Business Data Analysis by HiPIC of CalStateLA☆19Updated 5 years ago