retroryan / SparkAtScaleLinks
SparkAtScale
☆11Updated 8 years ago
Alternatives and similar repositories for SparkAtScale
Users that are interested in SparkAtScale are comparing it to the libraries listed below
Sorting:
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆94Updated 6 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆155Updated 3 years ago
- ☆76Updated 10 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 9 years ago
- ☆26Updated 5 years ago
- Example projects for using Spark and Cassandra With DSE Analytics☆58Updated 2 years ago
- Cassandra Node Diagnostics Tools☆51Updated 7 years ago
- Showcase for IoT Platform Blog☆60Updated 6 years ago
- A Bulk Data Pipeline out of Cassandra☆323Updated 6 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 5 years ago
- ☆17Updated 9 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- Kafka consumer lag-checking application for monitoring, written in Scala and Akka HTTP; a wrap around the Kafka consumer group command. I…☆197Updated 2 years ago
- Coral is a real-time analytics and data science platform. It transforms streaming events and extract patterns from data via RESTful APIs.…☆147Updated 5 years ago
- ☆68Updated 9 years ago
- Wikipedia stream-processing demo using Kafka Connect and Kafka Streams.☆75Updated 7 years ago
- DataStax Enterprise running in a Docker Container☆47Updated 7 years ago
- Cassandra Dataset Manager☆32Updated 8 years ago
- Examples on how to use the command line tools in Avro Tools to read and write Avro files☆154Updated last year
- Documentation tool for Avro schemas☆149Updated 5 years ago
- Cassandra schema migration tool for java☆98Updated 3 years ago
- A tool for testing the DataStax Spark Connector against Apache Cassandra or DSE☆25Updated 2 years ago
- Hadoop output committers for S3☆109Updated 5 years ago
- [DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine☆109Updated 5 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- Delimited file loader for Cassandra☆198Updated 5 years ago
- Mirror of Apache Apex malhar☆132Updated 5 years ago
- ☆22Updated 8 years ago
- Cassandra Utilities from ProtectWise☆31Updated 7 years ago