Alluxio / alluxio-pyLinks
Alluxio Python client - Access Any Data Source with Python
☆29Updated 3 weeks ago
Alternatives and similar repositories for alluxio-py
Users that are interested in alluxio-py are comparing it to the libraries listed below
Sorting:
- ☆14Updated 3 years ago
- CDAP UI☆20Updated this week
- Python client for Spark Jobserver Rest API☆39Updated 5 years ago
- Verify Hive SQL without running the sql exactly. Just check the syntax before run.☆24Updated 12 years ago
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- A plugin for Apache Airflow that allows you to manage the users that can login☆14Updated 5 years ago
- SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning tool…☆76Updated 6 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆48Updated 2 years ago
- Flink image for Kubernetes that fixes Jobmanage connection issue☆26Updated 7 years ago
- General Metadata Architecture☆127Updated this week
- ☆31Updated 8 years ago
- ☆39Updated 6 years ago
- Continuous scalable web crawler built on top of Flink and crawler-commons☆52Updated 6 years ago
- ☆11Updated 9 years ago
- SQLFlow client library for Python☆29Updated 2 years ago
- Mirror of Apache Chukwa☆85Updated 6 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- sql interface for solr cloud☆40Updated 2 years ago
- A command line interface for your tiny smart workers.☆15Updated last month
- ☆48Updated last year
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 8 months ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 8 years ago
- Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.☆37Updated 3 years ago
- Kylin running in a Docker cluster☆46Updated 9 years ago
- Kylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies…☆22Updated 6 years ago
- The Accelerator is a tool for fast and reproducible processing of large amounts of data.☆150Updated 3 years ago
- Spark SQL UDF examples☆56Updated 7 years ago
- Python and Scala APIs for enhanced Spark analytics☆12Updated 8 years ago
- Teradata SQL Extension for Jupyter☆27Updated 3 months ago
- Apache Phoenix Query Server☆50Updated 2 months ago