bloomreach / brieflyLinks
Briefly - A Python Meta-programming Library for Job Flow Control
☆106Updated 7 years ago
Alternatives and similar repositories for briefly
Users that are interested in briefly are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Utils around luigi.☆66Updated 2 months ago
- Serializes data into a JSON format using AVRO schema.☆138Updated 3 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 7 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 7 years ago
- Cloud ready pure-python streaming data pipeline library☆153Updated 3 weeks ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆154Updated 9 years ago
- A pipeline abstraction for Python☆168Updated 4 years ago
- Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python☆246Updated 2 years ago
- fast online statistics collection☆78Updated 3 months ago
- Simple spill-to-disk dictionary☆61Updated 3 years ago
- A Python library for dealing with splittable files☆42Updated 5 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- python implementation of the parquet columnar file format.☆21Updated last month
- Performance metrics, based on Coda Hale's Yammer metrics☆196Updated 2 years ago
- Application instrumentation and logging, with a geological bent.☆145Updated 3 years ago
- Package and ship relocatable python virtualenvs, like a boss.☆169Updated 6 years ago
- A wrapper around gitpython to produce pandas dataframes for analysis☆191Updated 4 months ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆131Updated 7 years ago
- S3-backed notebook manager for IPython☆29Updated 8 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Updated 9 years ago
- Docker image for an IPython 3/Jupyter Notebook and Terminal with full Anaconda Install☆86Updated 6 years ago
- Python library for class-based schema definition, object serialization and data validation☆61Updated 10 years ago
- SQL on dataframes - pandas and dask☆64Updated 7 years ago
- mito ETL tool☆163Updated 4 years ago
- A Postgres-backed ContentsManager implementation for Jupyter☆150Updated 2 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆271Updated last year
- A python RPC client stack☆45Updated 4 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 5 years ago
- Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity,…☆168Updated 6 years ago