Alluxio / alluxio-pyLinks
Alluxio Python client - Access Any Data Source with Python
☆29Updated last week
Alternatives and similar repositories for alluxio-py
Users that are interested in alluxio-py are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 12 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆39Updated 6 years ago
- General Metadata Architecture☆129Updated last week
- Continuous scalable web crawler built on top of Flink and crawler-commons☆52Updated 6 years ago
- ☆11Updated 9 years ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Updated 2 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- SQLFlow client library for Python☆29Updated 2 years ago
- CDAP UI☆20Updated 3 weeks ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated 3 weeks ago
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Updated 2 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- ☆45Updated last year
- ☆48Updated 2 years ago
- A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking☆20Updated 7 years ago
- Mirror of Apache Chukwa☆85Updated 6 years ago
- A library for Spark DataFrame using MinIO Select API☆99Updated 5 years ago
- SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning tool…☆76Updated 6 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- Python client for Spark Jobserver Rest API☆40Updated 5 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- ☆30Updated 8 years ago
- spark-drools tutorials☆16Updated last year
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 7 years ago
- PMML scoring library for Scala☆65Updated last month