asavinov / bistro
A general-purpose data analysis engine radically changing the way batch and stream data is processed
☆7Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for bistro
- BigTable, Document and Graph Database with Full Text Search☆185Updated 6 years ago
- The Chronix Server implementation that is based on Apache Solr.☆264Updated 5 years ago
- Feature engineering and machine learning: together at last!☆23Updated 3 years ago
- A blazing fast ACID compliant NoSQL DataLake with support for storing 17 formats of data. Full SQL and DML capabilities along with Java s…☆174Updated 8 months ago
- Time series analysis with Apache Spark based on Chronix |☆38Updated 7 years ago
- UI for interactive data analysis | https://snorkel.logv.org☆161Updated 8 months ago
- invesdwin-context modules that provide persistence features☆43Updated this week
- Wayeb is a Complex Event Processing and Forecasting (CEP/F) engine written in Scala.☆150Updated last year
- Arc is an opinionated framework for defining data pipelines which are predictable, repeatable and manageable.☆169Updated 9 months ago
- A simple data consistency checker☆30Updated 7 years ago
- Java INtegrated Query in parlance with LINQ is an ultra minimalistic library for Java inspired from and mimicking the .NET LINQ. While LI…☆85Updated 2 years ago
- Go implementation of MIDAS: Microcluster-Based Detector of Anomalies in Edge Streams☆187Updated 4 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆302Updated 6 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆57Updated 3 years ago
- afctl helps to manage and deploy Apache Airflow projects faster and smoother.☆130Updated 2 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆24Updated 8 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- The Apache Storm implementation of the Bullet backend☆40Updated last year
- GHCJS front-end for queryparser☆80Updated 6 years ago
- poor man's kafka (plus in-place mutations and search)☆110Updated 2 years ago
- A totally proof-of-concept FoundationDB based network block device backend☆115Updated 6 years ago
- Query engine for TrailDB☆51Updated 5 years ago
- A column oriented, embarrassingly distributed relational event database.☆240Updated 6 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆113Updated 3 years ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆130Updated 6 years ago
- A highly configurable Google Cloud Dataflow pipeline that writes data into Google Big Query table from Pub/Sub☆67Updated 6 years ago
- A learned index structure☆52Updated 3 years ago