UBOdin / mimirLinks
Data-ish exploration through SQL+Uncertainty
☆27Updated 3 years ago
Alternatives and similar repositories for mimir
Users that are interested in mimir are comparing it to the libraries listed below
Sorting:
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆113Updated 5 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Updated 8 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- something to help you spark☆64Updated 7 years ago
- Rete-based rule engine in Scala☆35Updated 14 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- Advanced Analytics Engine for NoSQL Data☆403Updated 12 years ago
- Simple Samza Job Using Confluent Platform☆14Updated 9 years ago
- [NOT MAINTAINED] DataExpress is a simple, Scala-based cross database ETL toolkit supporting Postgres, MySql, Oracle, SQLServer, and Sqlit…☆73Updated 6 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 3 years ago
- Fluent Scala DSL for Google's Cloud Dataflow SDK☆56Updated 10 years ago
- SynapseGrid is a framework for constructing dynamic low latency data flow systems.☆123Updated 4 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Provides a SQL interface to your TinkerPop enabled graph db☆75Updated 2 years ago
- Apache Amaterasu☆56Updated 6 years ago
- A collection of Apache Parquet add-on modules☆30Updated last week
- A quotation-based Scala DSL for scalable data analysis.☆63Updated 3 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 9 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- Pig on Apache Spark☆82Updated 10 years ago
- Standalone alternatives to Kafka Connect Connectors☆45Updated this week
- Use SQL to transform your avro schema/records☆28Updated 7 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆24Updated 5 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 6 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆15Updated 8 years ago
- Scala implementation of docopt language☆37Updated 7 years ago