DataCakeCloud / PolycatLinks
Polycat is a cutting-edge cloud-native metastore system, purpose-built to cater to the demands of modern data management in lakehouse deployments. It offers a comprehensive solution for organizations that need to manage metadata from multiple data sources across different clouds, all in one unified platform.
☆18Updated last year
Alternatives and similar repositories for Polycat
Users that are interested in Polycat are comparing it to the libraries listed below
Sorting:
- Remote Shuffle Service for Flink☆191Updated 3 years ago
- ☆393Updated last year
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,025Updated this week
- Compass is a task diagnosis platform for bigdata☆404Updated last year
- Web ui for Apache Paimon.☆140Updated last year
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆256Updated 2 years ago
- alibabacloud-maxcompute-tool-migrate☆30Updated last year
- Testing Sandbox for Hadoop Ecosystem Components☆43Updated last month
- Uniffle is a high performance, general purpose Remote Shuffle Service.☆436Updated last week
- The Lineage Analysis system for FlinkSQL supports advanced syntax such as Watermark, UDTF, CEP, Windowing TVFs, and CTAS.☆410Updated last month
- Spark Connector for Apache Doris☆103Updated 3 weeks ago
- 剥离的模块,用于查看Spark SQL生成的语法树☆91Updated 6 years ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆262Updated last year
- 汇总Apache Iceberg相关的最新文章、资料以及Demo等☆32Updated 4 years ago
- Apache Paimon Website☆17Updated this week
- 汇总Apache Hudi相关资料☆560Updated last week
- Shuttle:High Available, High Performance Remote Shuffle Service☆157Updated 2 years ago
- ☆23Updated 7 years ago
- Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.☆285Updated last month
- Gluten: Plugin to Boost Trino's Performance☆76Updated 2 years ago
- ☆106Updated 2 years ago
- Trino Connector for Apache Paimon.☆40Updated 2 weeks ago
- Spark ClickHouse Connector build on DataSourceV2 API☆210Updated last week
- ☆21Updated 2 years ago
- Benchmarks for Apache Flink☆182Updated last week
- spark 字段血缘 spark field lineage☆32Updated 3 years ago
- Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution☆140Updated 3 years ago
- 基于antlr4的sql解析,实现格式化,元数据,血源等自定义解析,包括hive☆111Updated 3 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆877Updated last month
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,095Updated this week