snowplow-archive / schema-guru
JSONs -> JSON Schema
☆151Updated 4 years ago
Alternatives and similar repositories for schema-guru:
Users that are interested in schema-guru are comparing it to the libraries listed below
- A decisioning and response platform☆70Updated 3 years ago
- Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow☆209Updated last week
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Contains all JSON Schemas, Avros and Thrifts for Iglu Central☆120Updated last week
- Apache Spark AWS Lambda Executor (SAMBA)☆44Updated 6 years ago
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 6 months ago
- A Kafka-Connect Sink for S3 with no Hadoop dependencies.☆57Updated 2 years ago
- Avro to JSON Schema, and back☆133Updated 11 months ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 5 years ago
- Amazon Kinesis Aggregators provides a simple way to create real time aggregations of data on Amazon Kinesis.☆150Updated 3 years ago
- Redshift Ops Console☆92Updated 9 years ago
- SQL for many helpful Redshift UDFs, and the scripts for generating and testing those UDFs☆125Updated 6 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆95Updated 5 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- Unofficial Apache NiFi Docker images☆50Updated 7 years ago
- Scheduled task execution on top of AWS Data Pipeline☆43Updated 10 years ago
- Cubes over ElasticSearch. Aggregation library for Business Intelligence☆20Updated 10 years ago
- Kafka Connect Connector for Jenkins Open Source Continuous Integration Tool☆30Updated 2 years ago
- An easily-deployable, single-instance version of Snowplow☆127Updated 2 months ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 4 years ago
- Docker image for AirBnB's Caravel☆34Updated 8 years ago
- Bender - Serverless ETL Framework☆185Updated last year
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆156Updated 2 years ago
- Run templatable playbooks of Hadoop/Spark/et al jobs on Amazon EMR☆19Updated 11 months ago
- Spotify's Luigi + Amazon's SWF integration☆16Updated 9 years ago
- Experiments in Streaming☆60Updated 8 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 2 years ago
- ☆33Updated last year