flyteorg / datacatalog
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
☆54Updated last year
Alternatives and similar repositories for datacatalog:
Users that are interested in datacatalog are comparing it to the libraries listed below
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆39Updated last year
- A apache commons style library in Golang, use by the Flyte project. Contains utilities for metrics, pflags, config management, storage ab…☆60Updated last year
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Updated last year
- FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and…☆46Updated last year
- Flyte Backend Plugins contributed by the Flyte community.☆28Updated last year
- Specification of the IR for Flyte workflows and tasks. Also Interfaces for all backend services. https://docs.flyte.org/projects/flyteidl…☆28Updated last year
- Go Client for Hive Metastore☆14Updated 2 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 3 months ago
- Data Catalog for Databases and Data Warehouses☆33Updated last year
- Apache Pinot Golang Client managed by StarTree☆28Updated last week
- Demos of Materialize, the operational data warehouse.☆51Updated last week
- Presto & Alluxio Dockers for blazing fast analytics☆13Updated 5 years ago
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25Updated 2 years ago
- Golang based remote data frames access (over gRPC or HTTP stream)☆28Updated last month
- Bigtable data source for Apache Arrow DataFusion☆23Updated 2 years ago
- Pulsar weekly community update☆10Updated 2 years ago
- A library for Spark DataFrame using MinIO Select API☆97Updated 5 years ago
- Kubernetes (K8s) Operator for PrestoDB☆46Updated 3 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes☆53Updated 4 years ago
- Kafka replicator is a tool used to mirror and backup Kafka topics across regions☆15Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Traffic routing for Trino Clusters☆26Updated 2 weeks ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆27Updated this week
- [ARCHIVED] The Presto adapter plugin for dbt Core☆33Updated last year
- Dom's Data Build Tool☆69Updated last year
- ☆69Updated 2 months ago