A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
☆305Oct 8, 2025Updated 6 months ago
Alternatives and similar repositories for bigdata-file-viewer
Users that are interested in bigdata-file-viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- easy install parquet-tools☆183Jul 9, 2024Updated last year
- ☆11Nov 16, 2022Updated 3 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A small library of hive UDFS using Macros to process and manipulate complex types☆15Oct 2, 2025Updated 6 months ago
- JGeocoder is a free, open source geocoder implemented in Java. It assigns geocoordinates to postal addresses using the Federal Census Bu…☆18Dec 6, 2022Updated 3 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆10Feb 10, 2023Updated 3 years ago
- Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.☆22Mar 24, 2026Updated 3 weeks ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆897Updated this week
- Apache Parquet Java☆3,051Updated this week
- Experimental support for serializing DataFusion plans using substrait☆46Jan 13, 2023Updated 3 years ago
- TensorDB: In-Database Tensor Manipulation with Tensor-Relational Query Plans☆21Jul 25, 2014Updated 11 years ago
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆137Mar 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,547Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,139Updated this week
- phoenix☆12Oct 4, 2022Updated 3 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- A custom ContentRepository implementation for NiFi to persist data to MinIO Object Storage☆35Jul 15, 2022Updated 3 years ago
- A small project to allow publishing data to Apache Kafka, Apache Pulsar or any other target system☆15Sep 21, 2020Updated 5 years ago
- The solution is can help reduce AWS operational costs for both development and production environments.☆11Oct 1, 2017Updated 8 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,746Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- ☆108Jul 5, 2023Updated 2 years ago
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,122Updated this week
- https://blog.csdn.net/QXC1281/article/details/89070285☆551Mar 18, 2023Updated 3 years ago
- cfid: R package for identifying counterfactuals.☆11Dec 11, 2025Updated 4 months ago
- Apache Flink connector for ElasticSearch☆92Mar 30, 2026Updated 2 weeks ago
- Query and transform data with PRQL☆137Sep 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Apple OpenSource download tool☆13Apr 17, 2020Updated 5 years ago
- Apache Iceberg☆8,700Apr 7, 2026Updated last week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 5 months ago
- examples for apache calcite☆13Sep 16, 2022Updated 3 years ago
- ☆62Feb 10, 2026Updated 2 months ago
- Command line (CLI) tool to inspect Apache Parquet files on the go☆200Nov 9, 2023Updated 2 years ago
- Includes notes on using Apache Spark, with drill down on Spark for Physics, how to run TPCDS on PySpark, how to create histograms with S…☆462Dec 15, 2025Updated 3 months ago