A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
☆305Oct 8, 2025Updated 6 months ago
Alternatives and similar repositories for bigdata-file-viewer
Users that are interested in bigdata-file-viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 16, 2022Updated 3 years ago
- Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.☆18Mar 27, 2024Updated 2 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Jan 3, 2023Updated 3 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Sep 8, 2022Updated 3 years ago
- A Scala library for Firestore in Datastore mode☆13Jun 11, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A library that brings useful functions from various modern database management systems to Apache Spark☆62Sep 4, 2023Updated 2 years ago
- A small library of hive UDFS using Macros to process and manipulate complex types☆15Oct 2, 2025Updated 7 months ago
- JGeocoder is a free, open source geocoder implemented in Java. It assigns geocoordinates to postal addresses using the Federal Census Bu…☆18Dec 6, 2022Updated 3 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆10Feb 10, 2023Updated 3 years ago
- Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.☆22Mar 24, 2026Updated last month
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆898Apr 27, 2026Updated last week
- Apache Parquet Java☆3,055Updated this week
- [student project] UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Apr 21, 2020Updated 6 years ago
- Experimental support for serializing DataFusion plans using substrait☆46Jan 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SparkCube is an open-source project for extremely fast OLAP data analysis. SparkCube is an extension of Apache Spark.☆136Mar 6, 2023Updated 3 years ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,558Updated this week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,150Updated this week
- phoenix☆12Oct 4, 2022Updated 3 years ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆85Apr 12, 2025Updated last year
- A custom ContentRepository implementation for NiFi to persist data to MinIO Object Storage☆35Jul 15, 2022Updated 3 years ago
- A small project to allow publishing data to Apache Kafka, Apache Pulsar or any other target system☆15Sep 21, 2020Updated 5 years ago
- The solution is can help reduce AWS operational costs for both development and production environments.☆11Oct 1, 2017Updated 8 years ago
- Spark metrics related custom classes and sinks (e.g. Prometheus)☆188Aug 2, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,767Updated this week
- Go wrapper for Apache Arrow C++☆15Aug 24, 2021Updated 4 years ago
- 汇总Apache Hudi中的一些Demo,便于快速上手Apache Hudi(Apache Hudi Demos to help beginners know about Hudi)☆74Sep 13, 2020Updated 5 years ago
- A complete custom processor project, for your reference.☆17Sep 29, 2015Updated 10 years ago
- Example using Grafana with Druid☆11Mar 27, 2015Updated 11 years ago
- IoT Trucking App with Flink (with Table API & SQL)☆14Jul 4, 2018Updated 7 years ago
- ☆109Jul 5, 2023Updated 2 years ago
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,127Apr 23, 2026Updated last week
- https://blog.csdn.net/QXC1281/article/details/89070285☆550Mar 18, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- cfid: R package for identifying counterfactuals.☆11Dec 11, 2025Updated 4 months ago
- A library for querying Druid data sources with Apache Spark☆23Oct 28, 2020Updated 5 years ago
- Query and transform data with PRQL☆136Sep 23, 2023Updated 2 years ago
- Apache Flink connector for ElasticSearch☆94Mar 30, 2026Updated last month
- Apple OpenSource download tool☆13Apr 17, 2020Updated 6 years ago
- Apache Iceberg☆8,792Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆306Oct 30, 2025Updated 6 months ago