dreyco676 / nlp_spark
Natural Language Processing with Spark's MLlib
☆62Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for nlp_spark
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- A short guide for transitioning from Python to Scala☆65Updated 8 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 9 years ago
- Pydata Dallas 2015 Scikit-Learn Tutorial☆62Updated 9 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆64Updated 9 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- My machine learning model for the See Click Predict Fix Kaggle competition☆31Updated 7 years ago
- ☆41Updated 4 years ago
- Scripts to Analyze Pronto's Data Release☆25Updated 8 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 8 years ago
- PyData Madrid 2016 material for the talk: A Primer to recommendation Systems☆37Updated 8 years ago
- Python forecasting and smoothing library☆68Updated 5 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆36Updated 9 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 6 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 5 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- DePy 2015 Talk☆117Updated 6 years ago
- ☆146Updated 8 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆43Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 5 years ago
- Code and Notebooks for the Natural Language Processing with Python course.☆66Updated 6 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 4 years ago
- Collection of dask example notebooks☆57Updated 6 years ago