Cheng-Lin-Li / Spark
There are Python 2.7 codes and learning notes for Spark 2.1.1
☆24Updated 6 years ago
Alternatives and similar repositories for Spark:
Users that are interested in Spark are comparing it to the libraries listed below
- Finding customer lookalikes using Machine Learning in PySpark☆33Updated 6 years ago
- ☆63Updated 3 years ago
- ☆113Updated 7 years ago
- Kaggle: Quora Insincere Questions Classification - detect toxic content to improve online conversations☆36Updated 6 years ago
- Text Classification through CNN, RNN & HAN using Keras☆238Updated 5 years ago
- top 1% solution to toxic comment classification challenge on Kaggle.☆193Updated 6 years ago
- Collection of Deep Learning Text Classification Models in Keras; Includes a GPU tutorial.☆14Updated 6 years ago
- Yelp round-10 review comments classification using deep learning (LSTM and CNN) and natural language processing.☆75Updated 5 years ago
- ☆34Updated 5 years ago
- Top 1% rankings (22/3270) code sharing for Kaggle competition Sberbank Russian Housing Market: https://www.kaggle.com/c/sberbank-russian-…☆35Updated 7 years ago
- A parallel distributed implementation of DBSCAN on Spark using Python☆75Updated 6 years ago
- Used two different methods to predict the sentiment (positive or negative) of movie reviews.☆56Updated 6 years ago
- Recommender built using keras☆35Updated 6 years ago
- All my experiments with AI and ML☆118Updated 6 years ago
- ☆137Updated 6 years ago
- Experiments on how to use machine learning to rank a product catalog☆84Updated 7 years ago
- data analysis, big data development, cloud, and any other cool things!☆30Updated 6 months ago
- This repository contains the implementation of paper "Hierarchical Attentional Hybrid Neural Networks for Document Classification"☆59Updated 3 years ago
- Text classification with Convolution Neural Networks on Yelp, IMDB & sentence polarity dataset v1.0☆118Updated 3 years ago
- ☆32Updated 5 years ago
- TalkingData AdTracking Fraud Detection Challenge on Kaggle Competition☆13Updated 6 years ago
- Code for Kaggle Jigsaw Toxic Comment, 34th / 4551 (Top 1%) Solution☆41Updated 6 years ago
- A collection of Medium posts☆55Updated 6 years ago
- insight data engineering fellow project☆14Updated 8 years ago
- Address imbalance classes in machine learning projects.☆66Updated 6 years ago
- Accompanying code for the Medium article☆165Updated 5 years ago
- data preparation☆90Updated 6 years ago
- Sentiment Analysis LSTM recurrent neural network's.☆52Updated 5 years ago
- A handy Python wrapper of the famous VMSP algorithm for mining maximal sequential patterns.☆35Updated 7 years ago
- [ACM-WSDM] 3rd place solution at WSDM Cup 2019, Fake News Classification on Kaggle.☆63Updated 5 years ago