retroryan / SparkAtScaleLinks
SparkAtScale
☆11Updated 9 years ago
Alternatives and similar repositories for SparkAtScale
Users that are interested in SparkAtScale are comparing it to the libraries listed below
Sorting:
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆154Updated 3 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 6 years ago
- A tool for testing the DataStax Spark Connector against Apache Cassandra or DSE☆25Updated 2 years ago
- Delimited file loader for Cassandra☆199Updated 6 years ago
- ☆26Updated 6 years ago
- Cassandra Dataset Manager☆32Updated 9 years ago
- ☆76Updated 10 years ago
- Cassandra schema migration tool for java☆100Updated 3 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Updated 4 months ago
- With Cassandra-3.4+ the tracing implementation can be replaced with zipkin tracing as provided by this project.☆48Updated 3 years ago
- Documentation tool for Avro schemas☆150Updated 6 years ago
- Cassandra Node Diagnostics Tools☆50Updated 8 years ago
- Tools for working with sstables☆103Updated 2 months ago
- Showcase for IoT Platform Blog☆60Updated 7 years ago
- A Bulk Data Pipeline out of Cassandra☆324Updated 6 years ago
- Tools for parsing, creating and doing other fun stuff with sstables☆163Updated 8 years ago
- Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs.…☆148Updated 6 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- Generic tools and script to help operating Cassandra cluster☆56Updated 10 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆138Updated 3 years ago
- ☆17Updated 10 years ago
- Simplify getting Zeppelin up and running☆56Updated 9 years ago
- A utility for generating Oozie workflows from a YAML definition☆49Updated 6 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 10 years ago
- Wikipedia stream-processing demo using Kafka Connect and Kafka Streams.☆74Updated 8 years ago
- Avro to JSON Schema, and back☆136Updated last year
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- Random implementation notes☆33Updated 12 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆263Updated 2 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 6 years ago