Byhiras / pyavroc
☆49Updated this week
Related projects: ⓘ
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Updated 6 years ago
- Utils around luigi.☆65Updated 3 years ago
- Luigi Plugin for Hubot☆35Updated 8 years ago
- S3-backed notebook manager for IPython☆29Updated 7 years ago
- Unified interface for local and distributed ndarrays☆158Updated 5 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- python implementation of the parquet columnar file format.☆22Updated last week
- Partitioned storage system based on blosc. **No longer actively maintained.**☆153Updated 7 years ago
- ☆51Updated this week
- Battle-tested Apache Storm Multi-Lang implementation for Python☆71Updated 2 years ago
- Scheduled task execution on top of AWS Data Pipeline☆43Updated 9 years ago
- Apache Mesos backend for Dask scheduling library☆28Updated 6 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 6 years ago
- SQL on dataframes - pandas and dask☆64Updated 6 years ago
- ☆57Updated this week
- Python collections supporting parallel map/reduce style methods☆40Updated 11 months ago
- A query and aggregation framework for Bcolz (W2013-01)☆56Updated 2 months ago
- ☆37Updated this week
- Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python☆247Updated last year
- A python RPC client stack☆45Updated 2 years ago
- Cython based wrapper for libavro☆25Updated 4 years ago
- Simple spill-to-disk dictionary☆60Updated 2 years ago
- fast online statistics collection☆78Updated 2 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 5 years ago
- vbench: A tool for benchmarking your code through time, for showing performance improvement or regressions☆246Updated 6 years ago
- Utilities to work with Scala/Java code with py4j☆40Updated 8 months ago
- A Python wrapper for Cascading☆222Updated 4 years ago
- Briefly - A Python Meta-programming Library for Job Flow Control☆105Updated 6 years ago
- Cloud ready pure-python streaming data pipeline library☆154Updated 3 weeks ago