flyteorg / datacatalog
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
☆54Updated last year
Related projects ⓘ
Alternatives and complementary repositories for datacatalog
- Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management …☆39Updated last year
- The Flyte data-sidecar that helps move the input and output data intelligently between containers☆10Updated last year
- Data Catalog for Databases and Data Warehouses☆31Updated 10 months ago
- FlytePropeller is a Kubernetes native operator, that executes Flyte Workflows and Tasks. It has its own kubectl-flyte CLI to interact and…☆47Updated last year
- Flyte Backend Plugins contributed by the Flyte community.☆28Updated last year
- Specification of the IR for Flyte workflows and tasks. Also Interfaces for all backend services. https://docs.flyte.org/projects/flyteidl…☆28Updated last year
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 3 years ago
- Apache Pinot Golang Client managed by StarTree☆28Updated 7 months ago
- Presto & Alluxio Dockers for blazing fast analytics☆13Updated 5 years ago
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆66Updated 8 months ago
- Demos of Materialize, the operational data warehouse.☆50Updated 2 months ago
- A library for Spark DataFrame using MinIO Select API☆96Updated 5 years ago
- Go Client for Hive Metastore☆14Updated last year
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated this week
- Connectors for capturing data from external data sources☆50Updated this week
- Trino (f.k.a PrestoSQL) dialect for SQLAlchemy.☆25Updated 2 years ago
- Dom's Data Build Tool☆69Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 6 months ago
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆62Updated 2 years ago
- DynoYARN is a framework to run simulated YARN clusters and workloads for YARN scale testing.☆58Updated last year
- Serverless multi-protocol + multi-destination event collection system.☆195Updated last month
- Beneath is a serverless real-time data platform ⚡️☆83Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 7 months ago
- A testing framework for Trino☆26Updated this week
- Highly configurable Helm Presto Chart☆24Updated 5 years ago
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- This repo provides a starting point for building applications using SingleStore, Redpanda (by Vectorized), and the Go language. SingleSto…☆22Updated 8 months ago
- Export Airflow metrics (from mysql) in prometheus format☆29Updated 2 years ago
- ☆60Updated this week