grailbio / reflow
A language and runtime for distributed, incremental data processing in the cloud
☆965Updated last year
Alternatives and similar repositories for reflow:
Users that are interested in reflow are comparing it to the libraries listed below
- Robust, flexible and resource-efficient pipelines using Go and the commandline☆1,087Updated 6 months ago
- A serverless cluster computing system for the Go programming language☆553Updated last year
- Bigmachine is a library for self-managing serverless computing in Go☆200Updated last year
- Quilt is a data mesh for connecting people with actionable data☆1,330Updated this week
- An open source platform for managing and analyzing biomedical big data☆400Updated this week
- Specification for the Workflow Description Language (WDL).☆794Updated this week
- Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale product…☆1,009Updated this week
- Open-source command-line tool to run batch computing tasks and workflows on backend services such as Google Cloud.☆265Updated 5 months ago
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,527Updated last year
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,459Updated 2 months ago
- Distributed Stream Processing☆1,475Updated 3 years ago
- A golang expression evaluator & Library to build SQL query engine based functionality.☆865Updated last year
- columnar storage + NoSQL OLAP engine | https://logv.org☆306Updated 5 months ago
- Distributed Named Pipes☆454Updated 7 years ago
- TrailDB is an efficient tool for storing and querying series of events☆1,091Updated 4 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆302Updated 6 years ago
- Build platforms that flexibly mix SQL, batch, and stream processing paradigms☆740Updated last week
- A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.☆907Updated this week
- TileDB☆79Updated last year
- Data workflow tool, like a "Make for data"☆1,480Updated 2 years ago
- Berkeley Tree Database (BTrDB) server☆912Updated 3 years ago
- EliasDB a graph-based database.☆1,010Updated 2 years ago
- Kasper is a lightweight library for processing Kafka topics.☆440Updated 7 years ago
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆260Updated last year
- A SQLite vtable extension to read Parquet files☆270Updated 3 years ago
- Ko: A generic type-safe language for concurrent, stateful, deadlock-free systems and protocol manipulations☆307Updated last year
- Temporal graph store abstraction layer.☆982Updated last year
- Control Data Store☆268Updated 3 weeks ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- A general-purpose data analysis engine radically changing the way batch and stream data is processed☆7Updated 6 years ago