blaze / datafabric
A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.
☆13Updated 9 years ago
Alternatives and similar repositories for datafabric:
Users that are interested in datafabric are comparing it to the libraries listed below
- Fast and modular async task library for Google App Engine.☆37Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- Deploy Dask on Marathon☆10Updated 8 years ago
- Enhance your feature engineering workflow with Kodiak☆19Updated last year
- A python module that will check for package updates.☆28Updated 3 years ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- A library for constructing finite state machines☆56Updated 9 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- A service implementing the Carbon protocol and storing time series data using kairos☆42Updated 4 years ago
- A Pachyderm deep learning tutorial for conference workshops☆19Updated 7 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- from zero to storm cluster for realtime classification using sklearn☆12Updated 10 years ago
- A small timeseries transformation API built on Flask and Pandas☆86Updated 2 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Interactive performance benchmarking in Jupyter☆33Updated 3 months ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Simple spill-to-disk dictionary☆17Updated 8 years ago
- Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).☆16Updated 10 years ago
- Spavro is a (sp)eedier avro library -- Spavro is a fork of the official Apache AVRO python 2 implementation with the goal of greatly impr…☆26Updated last year
- Dockerfiles for building docker images☆27Updated 4 months ago
- Pandas Msgpack☆23Updated 2 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- Docker container to make running Luigi tasks real easy.☆11Updated 8 years ago
- Library to convert OpenTSDB data to pandas datastructures☆15Updated 9 years ago
- Optional extensions for petl based on third party libraries.☆44Updated 9 years ago
- Fast, easy and intuitive machine learning prototyping.☆124Updated 10 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago