Alluxio / alluxio-pyLinks
Alluxio Python client - Access Any Data Source with Python
☆29Updated last month
Alternatives and similar repositories for alluxio-py
Users that are interested in alluxio-py are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- ☆39Updated 6 years ago
- Python client for Spark Jobserver Rest API☆40Updated 5 years ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 7 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- ☆30Updated 8 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 13 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- ☆11Updated 10 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆49Updated 2 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- A workflow scheduler understands both your data and metadata.☆28Updated 2 years ago
- ☆48Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆118Updated 2 years ago
- ☆41Updated 10 years ago
- Apache Phoenix Query Server☆51Updated last week
- General Metadata Architecture☆133Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated last week
- Apache Arrow Flight example☆11Updated 5 years ago
- SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning tool…☆76Updated 6 years ago
- ☆45Updated last year
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 3 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆36Updated 11 months ago
- PMML scoring library for Scala☆66Updated 3 weeks ago
- SQLFlow client library for Python☆29Updated 2 years ago
- The Accelerator is a tool for fast and reproducible processing of large amounts of data.☆149Updated 3 years ago
- CDAP UI☆20Updated this week
- Mirror of Apache Chukwa☆84Updated 6 years ago
- Here is my git repo for my Docker files related to Cloudera Hadoop CDH, to start, best is to check the documentation on https://github.co…☆56Updated 7 years ago
- Kylin running in a Docker cluster☆46Updated 9 years ago