flyteorg / datacatalogLinks
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
☆54Updated last year
Alternatives and similar repositories for datacatalog
Users that are interested in datacatalog are comparing it to the libraries listed below
Sorting:
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆39Updated last year
- A apache commons style library in Golang, use by the Flyte project. Contains utilities for metrics, pflags, config management, storage ab…☆60Updated last year
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Updated last year
- FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and…☆46Updated last year
- Flyte Backend Plugins contributed by the Flyte community.☆29Updated last year
- Repository containing Cloud Storage Connectors for Apache Kafka®☆35Updated last week
- Specification of the IR for Flyte workflows and tasks. Also Interfaces for all backend services. https://docs.flyte.org/projects/flyteidl…☆28Updated last year
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- Apache Pinot Golang Client managed by StarTree☆31Updated 2 months ago
- Presto & Alluxio Dockers for blazing fast analytics☆13Updated 5 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 6 months ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated last month
- Kubernetes (K8s) Operator for PrestoDB☆46Updated 3 years ago
- Pulsar weekly community update☆10Updated 3 years ago
- Explore Apache Kafka data pipelines in Kubernetes.☆46Updated 3 months ago
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 5 years ago
- Continuously synchronize directories from remote object store to local filesystem☆105Updated 3 months ago
- A tool for describing pure data pipelines that enables avoiding repeating work (incrementality) and keeping old data around (provenance)☆72Updated 5 years ago
- Opinionated serverless event analytics pipeline☆43Updated 2 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 months ago
- Airflow on Kubernetes Operator☆88Updated 2 years ago
- An HFile-backed Key-Value Server☆42Updated 6 years ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- A testing framework for Trino☆26Updated 3 months ago
- Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.☆79Updated this week
- States Language on Cadence☆63Updated 5 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago