asavinov / bistroLinks
A general-purpose data analysis engine radically changing the way batch and stream data is processed
☆7Updated 6 years ago
Alternatives and similar repositories for bistro
Users that are interested in bistro are comparing it to the libraries listed below
Sorting:
- HyperMinHash: Bringing intersections to HyperLogLog☆303Updated 7 years ago
- The Chronix Server implementation that is based on Apache Solr.☆265Updated 5 years ago
- A blazing fast ACID compliant NoSQL DataLake with support for storing 17 formats of data. Full SQL and DML capabilities along with Java s…☆175Updated last year
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- A simple data consistency checker☆30Updated 8 years ago
- invesdwin-context modules that provide persistence features☆43Updated last month
- Go implementation of MIDAS: Microcluster-Based Detector of Anomalies in Edge Streams☆188Updated 5 years ago
- UI for interactive data analysis | https://snorkel.logv.org☆163Updated last year
- opensource distributed database with base JPA implementation and event processing support☆75Updated last year
- A column oriented, embarrassingly distributed relational event database.☆240Updated 7 years ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- ☆27Updated 3 years ago
- The Apache Storm implementation of the Bullet backend☆40Updated 2 years ago
- Query engine for TrailDB☆51Updated 6 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- The Chronix storage based on Apache Lucene☆47Updated 7 years ago
- A learned index structure☆53Updated 4 years ago
- Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.☆150Updated last year
- Demonstrating the importance of laying out data in memory.☆47Updated 7 years ago
- poor man's kafka (plus in-place mutations and search)☆109Updated 2 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- ☆76Updated 8 years ago
- jpgAgent - A PostgreSQL job scheduler.☆45Updated 2 years ago
- Quickly detect already witnessed data.☆157Updated 11 months ago
- Monitoring and back pressure for task execution☆17Updated 7 years ago
- A collection of datasets and databases☆24Updated 7 years ago
- A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub☆67Updated 7 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- Compilation and rule-based optimization framework for relational algebra. Raco is the language, optimization, and query translation layer…☆72Updated 7 years ago