flyteorg / datacatalogLinks
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
☆53Updated 2 years ago
Alternatives and similar repositories for datacatalog
Users that are interested in datacatalog are comparing it to the libraries listed below
Sorting:
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆39Updated 2 years ago
- A apache commons style library in Golang, use by the Flyte project. Contains utilities for metrics, pflags, config management, storage ab…☆60Updated 2 years ago
- A distributed graph-based platform to automatically collect, discover, explore and relate multi-cluster Kubernetes resources and metadata…☆213Updated 2 years ago
- FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and…☆47Updated 2 years ago
- Apache Pinot Golang Client managed by StarTree☆33Updated 2 weeks ago
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Updated 2 years ago
- Airbyte is the go-sdk/cdk to help build connectors quickly in go. This package abstracts away much of the "protocol" away from the user a…☆41Updated last year
- Go Client for Hive Metastore☆14Updated 3 years ago
- An HFile-backed Key-Value Server☆43Updated 6 years ago
- A Cloud Native Query Engine. Serverless, if it fits your case.☆54Updated 2 years ago
- Altinity Dashboard helps you manage ClickHouse installations controlled by clickhouse-operator.☆68Updated last week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Kafka replicator is a tool used to mirror and backup Kafka topics across regions☆17Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Pulsar Beam is a streaming service via HTTP built on Apache Pulsar.☆60Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor …☆70Updated 5 months ago
- Golang based remote data frames access (over gRPC or HTTP stream)☆28Updated last month
- A library for Spark DataFrame using MinIO Select API☆99Updated 6 years ago
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆185Updated 2 weeks ago
- A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)☆72Updated 5 years ago
- States Language on Cadence☆63Updated 6 years ago
- A home for LinkedIn's changes to Apache Iceberg☆63Updated 3 weeks ago
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 9 months ago
- This repo provides a starting point for building applications using SingleStore, Redpanda (by Vectorized), and the Go language. SingleSto…☆23Updated last year
- A Golang client for RedisAI☆25Updated last year
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 3 years ago
- Kubernetes (K8s) Operator for PrestoDB☆46Updated 4 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆29Updated last month