uwescience / myria
Myria is a scalable Analytics-as-a-Service platform based on relational algebra.
☆113Updated 3 years ago
Alternatives and similar repositories for myria:
Users that are interested in myria are comparing it to the libraries listed below
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Compilation and rule-based optimization framework for relational algebra. Raco is the language, optimization, and query translation layer…☆72Updated 7 years ago
- ☆92Updated 9 years ago
- Quark is a data virtualization engine over analytic databases.☆98Updated 7 years ago
- People. Places. Things. Graphs.☆92Updated 10 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Probabilistic data structures server. The data model is key-value, where values are: Bloomfilters, LinearCounters, HyperLogLogs, CountMin…☆25Updated 9 years ago
- Secondary index on HBase☆18Updated 9 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆110Updated 8 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- Bloofi: A java implementation of multidimensional Bloom filters☆79Updated 9 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆426Updated 8 years ago
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 8 years ago
- Graph Analytics Engine☆260Updated 10 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 4 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Cascading on Apache Flink®☆54Updated last year
- Provides a SQL interface to your TinkerPop enabled graph db☆74Updated last year
- Muppet☆126Updated 3 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆168Updated 4 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- Large scale query engine benchmark☆99Updated 8 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- ☆95Updated 9 years ago
- The Musketeer workflow manager.☆41Updated 6 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆78Updated 10 months ago