TF-IDF with Spark for the Kaggle popcorn competition
☆10Jul 1, 2015Updated 10 years ago
Alternatives and similar repositories for tf-idf-spark-and-python
Users that are interested in tf-idf-spark-and-python are comparing it to the libraries listed below
Sorting:
- I developed this case study only in 7 days with Pyspark (Spark 1.6.0) SQL & MLlib. I used Databricks cluster and AWS. %90 AUC is achieved…☆17May 7, 2016Updated 9 years ago
- Code for the Kaggle competition "Bag of Words Meets Bags of Popcorn"☆50Jul 1, 2015Updated 10 years ago
- Different entries to kaggle contests using Apache Spark☆13Jun 5, 2017Updated 8 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Feb 10, 2016Updated 10 years ago
- ☆11Sep 6, 2019Updated 6 years ago
- Bosch Kaggle competion: Reduce manufacturing failures (https://www.kaggle.com/c/bosch-production-line-performance)☆24Nov 13, 2016Updated 9 years ago
- ☆14Dec 22, 2015Updated 10 years ago
- ☆10Jul 21, 2017Updated 8 years ago
- Russian coreference resolution made as simple and accessible as could be☆12Sep 3, 2022Updated 3 years ago
- A naïve ant colony simulation. Only useful as a pretty simulation, not sophisticated enough for real ant colony optimisation or anything …☆14Jul 25, 2011Updated 14 years ago
- Collect iftop metrics and send them via telegraf format (and then import them how you like)☆12Aug 9, 2022Updated 3 years ago
- ☆11May 15, 2017Updated 8 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Automatically exported from code.google.com/p/jbirch☆12Sep 6, 2022Updated 3 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- A regularized version of RBM for unsupervised feature selection.☆13Nov 20, 2019Updated 6 years ago
- A basic tutorial on GIT from CodingForEntrepreneurs.com☆10Jul 1, 2014Updated 11 years ago
- an example of integrating Spark Streaming with Google Pub/Sub and Google Datastore☆17Mar 22, 2017Updated 8 years ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 3 years ago
- NyxOS is a 16-bit minimalistic OS☆10Jan 1, 2019Updated 7 years ago
- Gibbs sampling inference to LDA☆19Apr 4, 2014Updated 11 years ago
- Convolutional Neural Network model for Sentiment Analysis of IMDB movie reviews☆65Dec 20, 2016Updated 9 years ago
- R Code + Jupyter notebook for replicating analysis of when and where arrests in San Francisco occur.☆23Dec 7, 2015Updated 10 years ago
- ☆13Jun 30, 2019Updated 6 years ago
- Publication delays at PLOS and 3,475 other journals☆20Jul 7, 2015Updated 10 years ago
- Kaggle-Bag of Words Meets Bags of Popcorn☆23Jul 2, 2015Updated 10 years ago
- Click through rate prediction☆19Feb 14, 2017Updated 9 years ago
- solution for the 5th place of cikm cup 2014☆19Jan 28, 2015Updated 11 years ago
- Reinforcement Learning applied to the Snake Game☆11Jul 17, 2014Updated 11 years ago
- 人人网资料备份python脚本☆10Dec 7, 2018Updated 7 years ago
- Spark 2.0 Python Machine Learning examples☆98Oct 7, 2019Updated 6 years ago
- Sketch code along the reading of Mining Massive Datasets☆14Feb 9, 2016Updated 10 years ago
- Implementation of Fast Orthogonal Search (FOS) Algorithm in MATLAB☆14Aug 4, 2019Updated 6 years ago
- 1st place solution for GramEval-2020☆14Jan 13, 2023Updated 3 years ago
- This is a repository in which we take part in the big data competition, focusing on recommendation system.☆17May 24, 2016Updated 9 years ago
- Empirical tests of various bandit algorithms.☆16Dec 6, 2014Updated 11 years ago
- ☆12May 24, 2016Updated 9 years ago
- cs249_Parker_Proj1☆11Jun 13, 2014Updated 11 years ago
- 44th place solution in "Santander Customer Satisfaction"☆11May 16, 2016Updated 9 years ago