flyteorg / datacatalogLinks
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
☆53Updated last year
Alternatives and similar repositories for datacatalog
Users that are interested in datacatalog are comparing it to the libraries listed below
Sorting:
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆38Updated last year
- A apache commons style library in Golang, use by the Flyte project. Contains utilities for metrics, pflags, config management, storage ab…☆60Updated last year
- FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and…☆47Updated last year
- Apache Pinot Golang Client managed by StarTree☆31Updated last month
- This repo provides a starting point for building applications using SingleStore, Redpanda (by Vectorized), and the Go language. SingleSto…☆23Updated last year
- An HFile-backed Key-Value Server☆42Updated 6 years ago
- Go Client for Hive Metastore☆14Updated 2 years ago
- A distributed graph-based platform to automatically collect, discover, explore and relate multi-cluster Kubernetes resources and metadata…☆213Updated 2 years ago
- Golang based remote data frames access (over gRPC or HTTP stream)☆28Updated this week
- A Cloud Native Query Engine. Serverless, if it fits your case.☆54Updated 2 years ago
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆11Updated last year
- Airbyte is the go-sdk/cdk to help build connectors quickly in go. This package abstracts away much of the "protocol" away from the user a…☆39Updated last year
- Specification of the IR for Flyte workflows and tasks. Also Interfaces for all backend services. https://docs.flyte.org/projects/flyteidl…☆28Updated last year
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- A high-performance, reliable and extensible logging agent for uploading data to Kafka, Pulsar, etc.☆184Updated last week
- TalariaDB is a distributed, highly available, and low latency time-series database for Presto☆225Updated last year
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated last month
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 6 months ago
- Repository containing Cloud Storage Connectors for Apache Kafka®☆39Updated last week
- A curated list of awesome things related to the Cadence and Temporal Workflow Engines☆84Updated 4 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Demos of Materialize, the operational data warehouse.☆52Updated 5 months ago
- Distributed SQL query engine written in Go for big data☆91Updated 7 years ago
- Connectors for capturing data from external data sources☆72Updated last week
- Pulsar Beam is a streaming service via HTTP built on Apache Pulsar.☆60Updated 3 years ago
- Serverless query engine☆140Updated 2 years ago
- Flyte Backend Plugins contributed by the Flyte community.☆29Updated last year
- Python framework for Cadence Workflow Service☆150Updated 3 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago