grailbio / reflow
A language and runtime for distributed, incremental data processing in the cloud
☆965Updated last year
Related projects ⓘ
Alternatives and complementary repositories for reflow
- A serverless cluster computing system for the Go programming language☆550Updated last year
- Bigmachine is a library for self-managing serverless computing in Go☆199Updated last year
- Robust, flexible and resource-efficient pipelines using Go and the commandline☆1,071Updated 2 months ago
- MacroBase: A Search Engine for Fast Data☆661Updated last year
- Quilt is a data mesh for connecting people with actionable data☆1,328Updated this week
- Build platforms that flexibly mix SQL, batch, and stream processing paradigms☆717Updated 3 weeks ago
- Vectorized processing for Apache Arrow☆486Updated 2 years ago
- Distributed Stream Processing☆1,480Updated 3 years ago
- columnar storage + NoSQL OLAP engine | https://logv.org☆305Updated 2 months ago
- 🐎 A serverless MapReduce framework written for AWS Lambda☆694Updated 2 years ago
- A golang expression evaluator & Library to build SQL query engine based functionality.☆862Updated last year
- Distributed Named Pipes☆453Updated 7 years ago
- Brushing and linking for big data☆947Updated last week
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,528Updated 8 months ago
- Unified Resource Scheduler to co-schedule mixed types of workloads such as batch, stateless and stateful jobs in a single cluster for bet…☆643Updated last year
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆942Updated 2 months ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆432Updated 6 years ago
- Molecule is a Go library for parsing protobufs in an efficient and zero-allocation manner.☆406Updated 5 months ago
- Data workflow tool, like a "Make for data"☆1,482Updated 2 years ago
- Berkeley Tree Database (BTrDB) server☆908Updated 3 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆302Updated 6 years ago
- you're invited to a data party!☆1,106Updated 2 years ago
- Skycfg is an extension library for the Starlark language that adds support for constructing Protocol Buffer messages.☆647Updated 2 months ago
- Probabilistic data structures for processing continuous, unbounded streams.☆1,594Updated 3 years ago
- A distributed, fault-tolerant pipeline for observability data☆1,736Updated 7 months ago
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,454Updated 2 months ago
- Ko: A generic type-safe language for concurrent, stateful, deadlock-free systems and protocol manipulations☆307Updated last year
- Control Data Store☆264Updated 3 months ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated 8 months ago
- Blb is a distributed object storage system designed for use on bare metal in cluster computing environments.☆625Updated last year