aaronbenz / caspanda
Using Pandas easily with Cassandra
☆23Updated 7 years ago
Alternatives and similar repositories for caspanda:
Users that are interested in caspanda are comparing it to the libraries listed below
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated last month
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- AsyncIO serving for data science models☆24Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 3 years ago
- a django app to persist and retrieve scikit learn machine learning models☆48Updated 2 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆87Updated 6 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Slack notifications for the Luigi workflow manager☆46Updated 3 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Dask powered gridsearch and pipeline a la scikit-learn☆42Updated 9 years ago
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Inline, interactive graphs inside jupyter/ipython notebooks☆16Updated 7 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- Tools for performing hyperparameter search with Scikit-Learn and Dask http://dask-searchcv.readthedocs.io☆11Updated 7 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- SigOpt wrappers for scikit-learn methods☆75Updated last year