asavinov / bistro
A general-purpose data analysis engine radically changing the way batch and stream data is processed
☆7Updated 6 years ago
Alternatives and similar repositories for bistro
Users that are interested in bistro are comparing it to the libraries listed below
Sorting:
- UI for interactive data analysis | https://snorkel.logv.org☆163Updated last year
- Time series analysis with Apache Spark based on Chronix |☆38Updated 8 years ago
- The Chronix Server implementation that is based on Apache Solr.☆265Updated 5 years ago
- A blazing fast ACID compliant NoSQL DataLake with support for storing 17 formats of data. Full SQL and DML capabilities along with Java s…☆175Updated last year
- Feature engineering and machine learning: together at last!☆24Updated 4 years ago
- A simple data consistency checker☆30Updated 8 years ago
- Query engine for TrailDB☆51Updated 6 years ago
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated last year
- Go implementation of MIDAS: Microcluster-Based Detector of Anomalies in Edge Streams☆188Updated 5 years ago
- invesdwin-context modules that provide persistence features☆43Updated 2 weeks ago
- HyperMinHash: Bringing intersections to HyperLogLog☆304Updated 7 years ago
- The Chronix storage based on Apache Lucene☆47Updated 7 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- The Apache Storm implementation of the Bullet backend☆40Updated 2 years ago
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 2 years ago
- ScalienDB is a scalable, replicated datastore.☆86Updated 12 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- Bender - Serverless ETL Framework☆185Updated last year
- A collection of datasets and databases☆24Updated 6 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- ☆27Updated 2 years ago
- ☆76Updated 8 years ago
- A column oriented, embarrassingly distributed relational event database.☆240Updated 7 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Doradus is a REST service that extends a Cassandra NoSQL database with a graph-based data model, advanced indexing and search features, a…☆204Updated 9 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- Java INtegrated Query in parlance with LINQ is an ultra minimalistic library for Java inspired from and mimicking the .NET LINQ. While LI…☆85Updated 3 years ago
- Quickly detect already witnessed data.☆157Updated 10 months ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- poor man's kafka (plus in-place mutations and search)☆109Updated 2 years ago