Alluxio / alluxio-pyLinks
Alluxio Python client - Access Any Data Source with Python
☆29Updated last week
Alternatives and similar repositories for alluxio-py
Users that are interested in alluxio-py are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- ☆39Updated 6 years ago
- Apache Phoenix Query Server☆51Updated 3 weeks ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Updated 2 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 12 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- Instant access to the Spark cluster from anywhere☆16Updated 4 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- Python client for Spark Jobserver Rest API☆39Updated 5 years ago
- PMML scoring library for Scala☆65Updated last week
- CDAP UI☆20Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated this week
- ☆48Updated last year
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Updated 2 years ago
- General Metadata Architecture☆129Updated last week
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 7 years ago
- ☆11Updated 9 years ago
- StreamLine - Streaming Analytics☆165Updated 2 years ago
- ☆30Updated 8 years ago
- A parser for SQL, which gives back identifiers and a hierarchical model for lineage tracking☆20Updated 7 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Apache Arrow Flight example☆11Updated 4 years ago
- SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning tool…☆77Updated 6 years ago
- Kylin running in a Docker cluster☆46Updated 9 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- Docker image for Apache Hive running on Tez☆25Updated 10 years ago
- Stratosphere is now Apache Flink.☆198Updated last year
- A workflow scheduler understands both your data and metadata.☆28Updated 2 years ago
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago