A New Way of Data Lake
☆47Dec 29, 2021Updated 4 years ago
Alternatives and similar repositories for StarLake
Users that are interested in StarLake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote pe…☆14Sep 18, 2023Updated 2 years ago
- Flink Hadoop Compatibility + Elasticsearch for Apache Hadoop = Flink Connector Elasticsearch Source Table。结合flink+hadoop+es 实现的es table s…☆20Jun 28, 2021Updated 4 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- A tool to get better debug info on spark's memory usage☆42Aug 21, 2019Updated 6 years ago
- database☆11Aug 31, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shu…☆257Apr 7, 2023Updated 3 years ago
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Apr 8, 2019Updated 7 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆32Apr 12, 2022Updated 4 years ago
- Important experiments on memory management, file access, network transfer, job scheduler, and so on.☆15Apr 27, 2022Updated 4 years ago
- ☆29Aug 2, 2018Updated 7 years ago
- Remote Shuffle Service for Flink☆191Jan 6, 2023Updated 3 years ago
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆18May 30, 2024Updated 2 years ago
- hudi 中文文档☆37Jan 9, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Apache Spark ETL Utilities☆39Oct 23, 2024Updated last year
- A simple demo about Flink Upsert-kafka☆16Mar 11, 2021Updated 5 years ago
- 考试系统--毕业设计☆13Jan 29, 2018Updated 8 years ago
- Log driver plugin for docker explained. The boilerplate code here can also be used to write your own driver if you are feeling adventurou…☆13Mar 13, 2019Updated 7 years ago
- 3D globe data visualization tool with customizable filters and data-to-view mapping☆14Jan 10, 2019Updated 7 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,569Updated this week
- 最简单的 spark sql on kubernetes 生产环境部署方案☆19Jun 12, 2023Updated 3 years ago
- SpringCloud练习代码 包含EUREKA FEIGN ZULL HYSTRIX ZULL SPRINGCONFIG TUBINE HYSTRIXDASHBORD 以及通过sidecar 异构非java语言的微服务☆16Apr 10, 2019Updated 7 years ago
- Alluxio源码分析、学习☆14Jan 22, 2017Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- ☆14Aug 7, 2021Updated 4 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆256Feb 21, 2023Updated 3 years ago
- A simple golang job queue☆13Jan 19, 2023Updated 3 years ago
- This project provides a reverse proxy for Spark UI on Kubernetes☆16Oct 12, 2023Updated 2 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Jul 1, 2019Updated 6 years ago
- A distributed in-memory key-value storage for billions of small objects.☆27Aug 23, 2019Updated 6 years ago
- Golang library for using persistent memory☆29Oct 7, 2022Updated 3 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Mar 15, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Parse medium.com posts to markdown☆19Sep 11, 2018Updated 7 years ago
- Exposes Redis stream through the command line☆12Jun 28, 2022Updated 3 years ago
- 这是一个基于 TiDB MySQL 语法解析器的一个工具集,支持1. SQL 指纹(sql fingerprint);2. 数据库库表对比(sql diff): 对比两个数据库的库表差异,并生成源库到目标库对应的差异( DDL) 语句。☆26Jul 13, 2022Updated 3 years ago
- 一个集分布式爬虫,分布式存储,分布式计算统计分析一体的统计分析数据挖掘项目☆14Feb 6, 2018Updated 8 years ago
- An exploration of Flink and change-data-capture via flink-cdc-connectors☆11Jul 7, 2021Updated 4 years ago
- A tool automatically improving the performance of large-scale systems by finding better configuration settings☆60Feb 23, 2022Updated 4 years ago
- Databooks, the book of data.☆21Jan 4, 2023Updated 3 years ago