grailbio / reflow
A language and runtime for distributed, incremental data processing in the cloud
☆967Updated last year
Alternatives and similar repositories for reflow:
Users that are interested in reflow are comparing it to the libraries listed below
- Bigmachine is a library for self-managing serverless computing in Go☆201Updated last year
- Robust, flexible and resource-efficient pipelines using Go and the commandline☆1,095Updated 8 months ago
- A serverless cluster computing system for the Go programming language☆553Updated last year
- An open source platform for managing and analyzing biomedical big data☆402Updated this week
- Build platforms that flexibly mix SQL, batch, and stream processing paradigms☆746Updated this week
- Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊☆1,463Updated 4 months ago
- 🐎 A serverless MapReduce framework written for AWS Lambda☆694Updated 3 years ago
- A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.☆909Updated this week
- Quilt is a data mesh for connecting people with actionable data☆1,334Updated this week
- Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale product…☆1,013Updated this week
- HyperMinHash: Bringing intersections to HyperLogLog☆304Updated 7 years ago
- MacroBase: A Search Engine for Fast Data☆666Updated 2 years ago
- HyperLogLog with lots of sugar (Sparse, LogLog-Beta bias correction and TailCut space reduction) brought to you by Axiom☆972Updated last month
- A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Doc…☆2,525Updated last year
- columnar storage + NoSQL OLAP engine | https://logv.org☆306Updated 7 months ago
- UI for interactive data analysis | https://snorkel.logv.org☆163Updated last year
- Distributed Stream Processing☆1,479Updated 4 years ago
- Language and framework for high performance computational pipelines.☆149Updated this week
- Berkeley Tree Database (BTrDB) server☆911Updated 3 years ago
- Vectorized processing for Apache Arrow☆485Updated 3 years ago
- Sparser: Raw Filtering for Faster Analytics over Raw Data☆432Updated 6 years ago
- Distributed Named Pipes☆455Updated 7 years ago
- Unified Resource Scheduler to co-schedule mixed types of workloads such as batch, stateless and stateful jobs in a single cluster for bet…☆647Updated last year
- Open-source command-line tool to run batch computing tasks and workflows on backend services such as Google Cloud.☆268Updated 7 months ago
- TileDB☆79Updated last year
- A key/value store for serving static batch data☆175Updated last year
- ZetaSQL - Analyzer Framework for SQL☆2,387Updated 3 weeks ago
- Ko: A generic type-safe language for concurrent, stateful, deadlock-free systems and protocol manipulations☆307Updated last year
- biogo is a bioinformatics library for Go☆394Updated last year
- A golang expression evaluator & Library to build SQL query engine based functionality.☆867Updated last year