aprial / growth-workshop
t test
☆10Updated 10 years ago
Alternatives and similar repositories for growth-workshop:
Users that are interested in growth-workshop are comparing it to the libraries listed below
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago
- content discovery... IN 3D☆49Updated 7 years ago
- Code for PyData Talk on "Classifying Products Based on Images and Text using Keras"☆30Updated 7 years ago
- Analysis pipeline for quick ML analyses.☆11Updated 6 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- JPMML-SparkML plugin for converting LightGBM-Spark models to PMML☆41Updated 3 years ago
- Fraud Detection using ensemble of Statistical, Network analysis and Machine learning approach.☆68Updated 10 years ago
- ☆11Updated 6 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- ☆36Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Updated 8 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 10 years ago
- implement some outlier detection algorithms☆11Updated 9 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- A simple tool for plotting Spark ML's Decision Trees☆41Updated 3 years ago
- AXA Driver Telematics Challenge on Kaggle.com☆51Updated 7 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- A Scalable Data Cleaning Library for PySpark.☆27Updated 5 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 10 years ago
- Posts, presentations and papers I've written.☆39Updated 4 years ago
- Containing codes of participation in Kaggle competitions.☆37Updated 9 years ago
- Code & Data for V3 of the Fast data Processing with Spark 2 book☆15Updated 8 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 9 years ago
- Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training☆18Updated 6 years ago