Parsely / probablyLinks
Probabilistic Data Structures in Python (originally presented at PyData 2013)
☆55Updated 4 years ago
Alternatives and similar repositories for probably
Users that are interested in probably are comparing it to the libraries listed below
Sorting:
- A Topic Modeling toolbox☆92Updated 9 years ago
- Collection of dask example notebooks☆57Updated 7 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 7 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆72Updated 6 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated 3 weeks ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 9 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Python forecasting and smoothing library☆67Updated 6 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 3 years ago
- Natural Language Processing with Spark's MLlib☆63Updated 8 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 9 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 7 years ago
- A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, trainin…☆100Updated 3 years ago
- Talk on "Tree models with Scikit-Learn: Great learners with little assumptions" presented at PyPata Paris 2015☆50Updated 10 years ago
- feng - feature engineering for machine-learning champions☆27Updated 8 years ago
- Simple approximate-nearest-neighbours in Python using locality sensitive hashing.☆142Updated 13 years ago
- PyTennessee 2014: Statistical Data Analysis in Python☆85Updated 11 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 9 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 7 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆59Updated 4 years ago
- A Python library for dealing with splittable files☆42Updated 6 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 10 years ago
- Material for some talks I have given☆61Updated last year
- Source code for the tutorial series at http://www.thoughtly.co/blog/prototype☆32Updated 10 years ago
- Scripts to Analyze Pronto's Data Release☆23Updated 10 years ago
- Common post-estimation tasks for scikit-learn☆17Updated 9 years ago
- Collection of pointers to slides and repositories from speakers at PyData Berlin 2016☆37Updated 9 years ago