Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
☆16May 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for orc-format
Users that are interested in orc-format are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A JupyterHub authenticator using Kerberos☆12Jun 2, 2026Updated last week
- HiveQL Jupyter Kernel☆10Aug 5, 2022Updated 3 years ago
- Rust implementation of Apache ORC☆29Apr 29, 2026Updated last month
- 跟踪Spark-sql中的字段血缘关系☆21Nov 11, 2024Updated last year
- TPC-H Benchmark on Cloudera Impala☆19Apr 25, 2013Updated 13 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆16May 22, 2026Updated 2 weeks ago
- Spark Shuffle Optimization with RDMA+AEP☆30May 23, 2023Updated 3 years ago
- Apache Kyuubi Site☆13May 30, 2026Updated last week
- PostgreSQL and GreenPlum Data Source for Apache Spark☆35May 6, 2026Updated last month
- ACL Management for Apache Spark SQL with Apache Ranger☆17Jun 18, 2020Updated 5 years ago
- SFTP server which works on the top of HDFS,It is based on Apache sshd to access and operate HDFS through SFTP protocol☆15Aug 18, 2023Updated 2 years ago
- general collection of notes☆10Oct 8, 2018Updated 7 years ago
- Apache Iceberg C++☆207Jun 1, 2026Updated last week
- NetEase Spark Courses☆15Sep 4, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ML-aided Query Optimizer☆17May 31, 2024Updated 2 years ago
- Apache CarbonData 源码阅读☆61Feb 12, 2020Updated 6 years ago
- Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-dis…☆21Mar 15, 2024Updated 2 years ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 4 years ago
- Performance optimization for Spark running on Kubernetes☆88Aug 18, 2020Updated 5 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apa…☆184Apr 6, 2022Updated 4 years ago
- RaptorJIT: a dynamic system programming language (manuscript)☆17Jun 4, 2019Updated 7 years ago
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Apr 16, 2026Updated last month
- Lustre Repository with MS patches☆17Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Auto detection of apt proxies in the LAN, caching and checking status☆10Feb 13, 2025Updated last year
- Source code for SIMD benchmarks and experiments in Java☆32Jun 30, 2017Updated 8 years ago
- Lightweight Protobuf codegen for TypeScript and JavaScript.☆13Jun 1, 2026Updated last week
- Scripts for managing Debian and RPM package repositories☆14Jan 14, 2026Updated 4 months ago
- All the things about TPC-DS in Apache Spark☆112Jun 15, 2023Updated 2 years ago
- Fluentd output plugin to Yandex ClickHouse.☆11Nov 8, 2017Updated 8 years ago
- Modern file sync tool with delta transfers, 40-79% faster than rsync☆25Apr 20, 2026Updated last month
- Spark ClickHouse Connector build on DataSourceV2 API☆219Updated this week
- Fast I/O plugins for Spark☆41Dec 14, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python wrappers for the FirecREST API☆12May 19, 2026Updated 3 weeks ago
- CLI tools for Slurm clusters☆13Apr 24, 2026Updated last month
- disk usage for IBM Storage Scale file systems☆12Jun 2, 2026Updated last week
- Running TPC-H on Apache Hive☆41Jul 15, 2019Updated 6 years ago
- Slurm job script archival☆12Apr 16, 2026Updated last month
- Script for doing Slurm Calculations☆12Mar 21, 2025Updated last year
- Utilities to support interacting with multiple HPC clusters☆11Nov 21, 2025Updated 6 months ago