blaze / datafabricLinks
A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.
☆13Updated 9 years ago
Alternatives and similar repositories for datafabric
Users that are interested in datafabric are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 7 years ago
- Collection of dask example notebooks☆57Updated 7 years ago
- Probabilistic Data Structures in Python (originally presented at PyData 2013)☆55Updated 4 years ago
- A service implementing the Carbon protocol and storing time series data using kairos☆42Updated 4 years ago
- Generate ipywidgets from Parameterized objects in the notebook☆35Updated 6 years ago
- Ansible role to deploy and configure Airflow☆41Updated 3 weeks ago
- A python module that will check for package updates.☆30Updated 4 years ago
- A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, trainin…☆100Updated 3 years ago
- python parallel map on kubernetes☆33Updated 8 years ago
- Deploy Dask on Marathon☆10Updated 8 years ago
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆123Updated last week
- Task scheduling and blocked algorithms for parallel processing☆17Updated 3 weeks ago
- A pandas.DataFrame-based ORM.☆85Updated 3 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 11 years ago
- Pandas Msgpack☆24Updated 3 years ago
- An implementation of the multi-armed bandit optimization pattern as a Flask extension☆81Updated 2 weeks ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 5 years ago
- Start a cluster in EC2 for dask.distributed☆105Updated 5 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 7 years ago
- Task Orchestration Tool Based on SWF and boto3☆39Updated 7 years ago
- Experimental docker-compose setup to bootstrap distributed on a docker-swarm cluster.☆92Updated 8 years ago
- High Level Kafka Scanner☆19Updated 8 years ago
- A Python library for dealing with splittable files☆42Updated 6 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 7 years ago
- AsyncIO serving for data science models☆24Updated 3 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago
- A Python wrapper for MADlib(http://madlib.net) - an open source library for scalable in-database machine learning algorithms☆63Updated 5 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Language defining a data description protocol☆186Updated 2 years ago