This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache Arrow as the exchanging data format.
☆49Apr 21, 2023Updated 2 years ago
Alternatives and similar repositories for pyjava
Users that are interested in pyjava are comparing it to the libraries listed below
Sorting:
- ☆13Jun 17, 2022Updated 3 years ago
- A library based on delta for Spark and MLSQL☆60Dec 24, 2020Updated 5 years ago
- ☆19Jun 16, 2021Updated 4 years ago
- This is a library for SQL optimizing/rewriting including Materialized View rewrite☆69Jun 21, 2022Updated 3 years ago
- ☆22Jun 21, 2022Updated 3 years ago
- Example Application to consume a twitter stream with Neo4j☆11Mar 16, 2017Updated 9 years ago
- Big data smart alarm by sql☆12May 11, 2021Updated 4 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆152Apr 21, 2023Updated 2 years ago
- A HBase datasource implementation for Spark and [MLSQL](http://www.mlsql.tech).☆15Sep 29, 2023Updated 2 years ago
- Byzer (former MLSQL): A low-code open-source programming language for data pipeline, analytics and AI.☆1,843May 29, 2024Updated last year
- ☆48Sep 11, 2023Updated 2 years ago
- ServiceFramework 示例项目☆10Apr 2, 2016Updated 9 years ago
- ☆12May 30, 2024Updated last year
- Demo for service oriented application hosted on Hadoop YARN cluster for HA and scheduling☆23Apr 2, 2018Updated 7 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Unified SQL Analytics Engine Based on SparkSQL☆211Dec 5, 2022Updated 3 years ago
- Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream …☆22Feb 6, 2017Updated 9 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Scala Tutorial☆15Dec 19, 2018Updated 7 years ago
- ☆12Mar 26, 2015Updated 10 years ago
- sparrow-passport-ddd☆11Sep 10, 2025Updated 6 months ago
- Alerting and monitoring tool for Apache Spark☆23May 20, 2022Updated 3 years ago
- ☆23Sep 25, 2024Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆18Jun 28, 2021Updated 4 years ago
- Package Objects☆12Jun 5, 2025Updated 9 months ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆30Apr 16, 2018Updated 7 years ago
- ☆13Jan 1, 2021Updated 5 years ago
- 深入ElasticSearch☆17Mar 8, 2016Updated 10 years ago
- Spark Structured Streaming Kafka 0.8 Source Implementation☆35Apr 27, 2017Updated 8 years ago
- a tailored Apache Calcite for Apache Kylin, more details at http://mail-archives.apache.org/mod_mbox/kylin-dev/201704.mbox/%3CCAF7etT=wEB…☆14Nov 7, 2025Updated 4 months ago
- ☆20May 31, 2023Updated 2 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 7 months ago
- Wormhole is a SPaaS (Stream Processing as a Service) Platform☆976Nov 16, 2022Updated 3 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- Processing videos on Apache Spark☆12Feb 14, 2022Updated 4 years ago
- Alternative caching backends for `{memoise}` & `{shiny}`.☆13Mar 27, 2023Updated 2 years ago
- Advent of Code solutions using dbt, duckdb, dbt-duckdb☆13Feb 5, 2023Updated 3 years ago
- mlsql-web based on vue 提供一个sql编辑器☆25Aug 26, 2018Updated 7 years ago
- R interface for Google Pub/Sub☆10Mar 3, 2023Updated 3 years ago