SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题
☆48Jan 4, 2019Updated 7 years ago
Alternatives and similar repositories for spark-skewed-join-hint
Users that are interested in spark-skewed-join-hint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 19, 2024Updated 2 years ago
- Testing Sandbox for Hadoop Ecosystem Components☆45Jun 16, 2026Updated last week
- A re-implementation of Hadoop DistCP in Apache Spark☆47Dec 20, 2023Updated 2 years ago
- An Extensible Data Skipping Framework☆50Jul 15, 2025Updated 11 months ago
- Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.☆260May 12, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆63Sep 4, 2023Updated 2 years ago
- fast spark local mode☆35Aug 20, 2018Updated 7 years ago
- fastdfs spring boot start☆10Mar 1, 2020Updated 6 years ago
- A Full RPC Framework Based on Netty.☆14May 19, 2018Updated 8 years ago
- flink sql☆11Jun 21, 2022Updated 4 years ago
- Apache Flink自述理解与代码☆13Mar 9, 2019Updated 7 years ago
- A playground for experimenting ideas that may apply to Spark SQL/Catalyst☆143Jul 5, 2018Updated 7 years ago
- 基于TBSchedule开发的一个分布式任务调度框架,可以解析任务间的依赖,并执行任务(执行Shell、bat脚本)☆12Aug 5, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于 Flink 的 sqlSubmit 程序☆144Jun 18, 2026Updated last week
- init☆11Sep 30, 2017Updated 8 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- 基于 MobileNet 模型, 使用 Tensorflow 的 Java API 进行图片的分类以及图形内物体识别。☆14Nov 12, 2018Updated 7 years ago
- Use maven-assembly-plugin to package a spring boot project into a non-fat jar☆10Jul 24, 2017Updated 8 years ago
- On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.☆35Apr 15, 2025Updated last year
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Apr 8, 2019Updated 7 years ago
- ☆10May 25, 2017Updated 9 years ago
- A simple project used to submit a Flink SQL script☆373Sep 2, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.☆58Nov 11, 2021Updated 4 years ago
- Stream computing platform for bigdata☆406Apr 24, 2024Updated 2 years ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,053Updated this week
- ☆233Sep 15, 2022Updated 3 years ago
- Simple Go 1.8 plugin test for https://jeremywho.com/go-1.8---plugins/☆10Feb 28, 2017Updated 9 years ago
- flink rest api的spring-boot-starter☆17Jun 14, 2023Updated 3 years ago
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- Paimon-cpp is a high-performance C++ implementation of Apache Paimon.☆122Updated this week
- 算法工程师 从零到一 https://dongfengchi.github.io/algo_engineer_zero_to_one/☆16Mar 9, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆116Apr 21, 2023Updated 3 years ago
- Cloudera CDP SDK for Java☆17Jun 8, 2026Updated 2 weeks ago
- RocketMQ Java 实现各种消息方式☆11Sep 15, 2017Updated 8 years ago
- Hive Ha server for multi hive backend,support SQL detect hive server alive☆15Aug 20, 2013Updated 12 years ago
- hive sql parser☆11Aug 27, 2014Updated 11 years ago
- Deformable Convolutional Networks v2 with Keras and Tensorflow1.x☆19Dec 3, 2020Updated 5 years ago
- Template for a DuckDB-based, Codespace-oriented sandbox project that is also dbt Cloud compatible, and includes code-first BI tooling via…☆17Apr 7, 2023Updated 3 years ago