schio / Data-Science
data-science-from-scratch of joelgrus
☆20Updated 6 years ago
Alternatives and similar repositories for Data-Science:
Users that are interested in Data-Science are comparing it to the libraries listed below
- simple solution based on Gradient Boost and Random Forest, rank 24/3251 (top 1%) within 60 lines of python code☆14Updated 5 years ago
- Library for Multi-instance Multi-label learning☆9Updated 11 months ago
- Repo for the Advanced Python Skills course that I created (hosted in Udemy and Skillshare)☆15Updated 4 years ago
- Scalable Data Analysis in Python with Dask, by Packt publishing☆11Updated 4 years ago
- ☆10Updated 5 years ago
- Examples and hacks inspired by the book Data Science from Scratch by Joel Grus☆27Updated 6 years ago
- Sequence pattern discovery using Generalized Sequential Pattern Mining Algorithm☆10Updated 4 years ago
- Notes on "Data Science from Scratch" by Joel Grus☆11Updated 8 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- Knowledge Discovery in Database. * In this project we focused our analysis on applying data analysis techniques, create visualizations an…☆10Updated 7 years ago
- ☆9Updated 5 years ago
- A abstract text classification library using language models. Build your fine-tuned text classifier in 5 steps.☆10Updated 4 years ago
- Machine Learning based model to predict Insurance Pure Premium☆12Updated 8 years ago
- Predicting if a employee is going to leave☆9Updated 7 years ago
- A library of techniques for local interpretation of machine learning models☆9Updated 2 years ago
- Hands-on Introduction to Machine Learning with Python, Pandas, Matplotlib and Scikit-Learn☆11Updated 5 years ago
- ACM RecSys Challenge 2016: Job recommender system for XING☆8Updated 8 years ago
- Creating a hybrid recommender system using LightFM. Learn how to tackle the cold start problem.☆13Updated 3 years ago
- I have written Machine learning equations from scratch in python using Andrew Ng Coursera dataset. Andrew-Ng-Coursera-Machine-learning-in…☆20Updated 3 years ago
- IMDB Movie Reviews Large Dataset - 50k Reviews☆10Updated 4 years ago
- BigTweet is an agent-based social simulator for rumor spreading models and rumor control strategies in Twitter with support for Big Data …☆10Updated 8 years ago
- Fraud detection in credit card payments and auto insurance claims using PySpark☆13Updated 6 years ago
- Code Repository for The Kaggle Book 2nd Edition, Published by Packt☆11Updated last week
- ☆8Updated 9 years ago
- Building Machine Learning Systems with Python by Packt Publishing☆51Updated 2 years ago
- ☆26Updated 2 years ago
- An Efficient Gaussian Kernel Based Fuzzy-Rough Set Approach for Feature Selection☆11Updated 6 years ago
- Mobile Artificial Intelligence Projects, published by Packt☆11Updated 2 years ago
- Why are our best and most experienced employees leaving prematurely?☆12Updated 7 years ago
- Anomaly detection system for medical insurance claims data☆15Updated 7 years ago