ybenoit / scikit-learn-to-spark-mlLinks
Notebook comparing scikit-learn and Spark ML for building Machine Learning Pipelines
☆13Updated 9 years ago
Alternatives and similar repositories for scikit-learn-to-spark-ml
Users that are interested in scikit-learn-to-spark-ml are comparing it to the libraries listed below
Sorting:
- Large-scale topic discovery with Sampled-MinHashing☆10Updated 5 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Updated 6 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 7 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago
- Social Media and Text Analytics Course at UPenn☆24Updated 2 years ago
- An Information Extraction Framework with Deep Learning developed at New York University☆15Updated 8 years ago
- Different approaches to computing document similarity☆28Updated 8 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23Updated 10 years ago
- ☆16Updated 7 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11Updated 10 years ago
- A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data, TPAMI, http://arxiv.org/abs/1409.3970☆39Updated 9 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆36Updated 2 years ago
- ☆13Updated 7 years ago
- tag doc using topN words with lda☆10Updated 9 years ago
- Variants of Multi-Perspective Convolutional Neural Networks☆22Updated last year
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Updated 7 years ago
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)☆13Updated 8 years ago
- ☆29Updated 10 years ago
- Neural Network Models for Multi-label learning☆17Updated 4 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Updated 9 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- ☆10Updated 9 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26Updated 8 years ago
- ☆25Updated 9 years ago
- Experiment code for AAAI paper: A Neural Probabilistic Model for Context Based Citation Recommendation☆9Updated 7 years ago
- ☆10Updated 9 years ago
- ☆18Updated 8 years ago
- Sequential convolutional architectures for text classification☆29Updated 9 years ago