snowplow / iglu
Iglu is a machine-readable, open-source schema repository for JSON Schema from the team at Snowplow
☆209Updated 2 weeks ago
Alternatives and similar repositories for iglu:
Users that are interested in iglu are comparing it to the libraries listed below
- A decisioning and response platform☆70Updated 3 years ago
- Avro to JSON Schema, and back☆133Updated 11 months ago
- Contains all JSON Schemas, Avros and Thrifts for Iglu Central☆120Updated this week
- Mirrors a Kinesis stream to Amazon S3 using the KCL☆42Updated 6 months ago
- JSONs -> JSON Schema☆151Updated 4 years ago
- Docker images for Snowplow, Iglu and associated projects☆61Updated 3 years ago
- The Schema Repo is a RESTful web service for storing and serving mappings between schema identifiers and schema definitions.☆155Updated 2 years ago
- DEPRECATED. PLEASE USE https://github.com/confluentinc/kafka-connect-bigquery. A Kafka Connect BigQuery sink connector☆153Updated last year
- Bender - Serverless ETL Framework☆185Updated last year
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆95Updated 5 years ago
- A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)☆71Updated 4 years ago
- Stores Snowplow enriched events in Redshift, Snowflake and Databricks☆31Updated last month
- Documentation tool for Avro schemas☆148Updated 5 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆111Updated 5 years ago
- JSON schema parser for Apache Spark☆81Updated 2 years ago
- Google BigQuery support for Spark, SQL, and DataFrames☆155Updated 5 years ago
- ☆33Updated last year
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 6 months ago
- Kinesis Connector for Structured Streaming☆136Updated 8 months ago
- Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆135Updated 2 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- Library offering http based query on top of Kafka Streams Interactive Queries☆69Updated last year
- Scala + Druid: Scruid. A library that allows you to compose queries in Scala, and parse the result back into typesafe classes.☆115Updated 3 years ago
- Airflow declarative DAGs via YAML☆132Updated last year
- Bulletproof Apache Spark jobs with fast root cause analysis of failures.☆72Updated 3 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆44Updated 2 years ago
- A Giter8 template for scio☆31Updated last month
- Ephemeral Hadoop clusters using Google Compute Platform☆135Updated 2 years ago
- Fast JSON to Avro converter☆61Updated 6 years ago