Java binding to Apache DataFusion
☆85Apr 14, 2025Updated 11 months ago
Alternatives and similar repositories for datafusion-java
Users that are interested in datafusion-java are comparing it to the libraries listed below
Sorting:
- ☆49Feb 14, 2022Updated 4 years ago
- Client libraries of end users of Apache Kyuubi☆11Jan 10, 2023Updated 3 years ago
- A MVP implementation of distributed query engine cut from datafusion-ballista codebase for learning purpose.☆12Jan 10, 2025Updated last year
- Apache DataFusion Ballista Distributed Query Engine☆1,993Updated this week
- ☆100Updated this week
- A JUnit Proxy/DNS rule for connecting to dockerised applications with standard hostnames and ports☆14Mar 13, 2026Updated last week
- ☆70Jan 3, 2025Updated last year
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- ☆33May 9, 2025Updated 10 months ago
- Official Java implementation of Apache Arrow☆83Updated this week
- Community Java bindings for https://github.com/facebookincubator/velox☆40Updated this week
- Code repository for SNARF☆13Apr 27, 2023Updated 2 years ago
- Java API for libaio☆15Jan 10, 2022Updated 4 years ago
- End-to-end SQL fuzz testing for DataFusion using SQLancer☆12Feb 9, 2026Updated last month
- Benchmarks for Apache Flink☆184Jan 4, 2026Updated 2 months ago
- ☆37Jun 5, 2024Updated last year
- Apache DataFusion Comet Spark Accelerator☆1,153Mar 13, 2026Updated last week
- a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.☆13Jun 13, 2023Updated 2 years ago
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆305Oct 30, 2025Updated 4 months ago
- Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.☆202Feb 27, 2026Updated 3 weeks ago
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Apr 2, 2018Updated 7 years ago
- ☆37Jul 26, 2022Updated 3 years ago
- SWIM Protocol in Java☆10Apr 1, 2020Updated 5 years ago
- GraphqlCRUDJava - Out of the box GraphQL CRUD for your database☆10Sep 16, 2022Updated 3 years ago
- Data Infra 研究社☆28Jun 17, 2025Updated 9 months ago
- A composable framework for fast and scalable data analytics☆57Dec 12, 2022Updated 3 years ago
- 同步数据的小工具☆17Feb 27, 2026Updated 3 weeks ago
- A command-line tool that automates several common multi-step operations in the IoT and related services.☆17Jan 14, 2026Updated 2 months ago
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,039Mar 11, 2026Updated last week
- Distributed consensus system with Map interface based on Apache Ratis☆27Nov 27, 2023Updated 2 years ago
- Gentics Mesh UI☆24Oct 4, 2023Updated 2 years ago
- SigTest is the tool for checking incompatibilities between different versions of the same API.☆10Feb 21, 2026Updated 3 weeks ago
- This is OpenMLDB's Spark Distribution, which is particularly optimized for feature extraction. It includes a few novel techniques, such a…☆12Jul 30, 2024Updated last year
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,116Mar 12, 2026Updated last week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- A software library of stochastic streaming algorithms, a.k.a. sketches.☆108Jan 20, 2026Updated last month
- The go to place for official fluvio connectors☆22Apr 26, 2023Updated 2 years ago
- ☆23Jan 23, 2022Updated 4 years ago
- A JVM-embeddable Distributed Database☆326Sep 1, 2025Updated 6 months ago