UBOdin / mimir
Data-ish exploration through SQL+Uncertainty
☆27Updated 2 years ago
Alternatives and similar repositories for mimir:
Users that are interested in mimir are comparing it to the libraries listed below
- A collection of Scala graph libraries and adapters for graph databases.☆15Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- Advanced Analytics Engine for NoSQL Data☆402Updated 11 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Spark Streaming ETL jobs for Mozilla Telemetry☆18Updated 5 years ago
- [NOT MAINTAINED] DataExpress is a simple, Scala-based cross database ETL toolkit supporting Postgres, MySql, Oracle, SQLServer, and Sqlit…☆72Updated 5 years ago
- ☆39Updated 8 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- something to help you spark☆65Updated 6 years ago
- Argument parsing in Scala☆83Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Rete-based rule engine in Scala☆35Updated 13 years ago
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 2 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 5 years ago
- Scala library for accessing various file, batch systems, job schedulers and grid middlewares.☆27Updated last week
- Diff tables taking account of their structure☆11Updated last year
- ETL orchestration platform with recoverability and process monitoring features☆9Updated 7 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Updated 4 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago