Alluxio / alluxio-pyLinks
Alluxio Python client - Access Any Data Source with Python
☆29Updated last week
Alternatives and similar repositories for alluxio-py
Users that are interested in alluxio-py are comparing it to the libraries listed below
Sorting:
- Dremio Flight connector. Access Dremio using Arrow flight☆40Updated 4 years ago
- Apache Phoenix Query Server☆50Updated 2 months ago
- Llama - Low Latency Application MAster☆34Updated 3 years ago
- The Accelerator is a tool for fast and reproducible processing of large amounts of data.☆150Updated 3 years ago
- Apache Kibble - a tool to collect, aggregate and visualize data about any software project☆60Updated 7 months ago
- The Internals of Apache Beam☆12Updated 5 years ago
- Repository for building Apache Ozone Docker images☆15Updated last month
- Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data strea…☆26Updated 2 years ago
- The canonical source of StreamNative Hub.☆17Updated this week
- Run TPCH Benchmark on Apache Kylin☆22Updated 3 years ago
- A shim for using Cassandra as a backend for OpenTSDB. Not to be used as a general Cassandra client.☆7Updated 6 years ago
- ☆28Updated 7 years ago
- Repository for the Spark-Vector connector☆20Updated 4 years ago
- General Metadata Architecture☆127Updated this week
- Cask Hydrator Plugins Repository☆68Updated this week
- Ambari stack service for easily installing and managing Solr on HDP cluster☆19Updated 6 years ago
- Cubes OLAP Examples☆74Updated 7 years ago
- Python client for Spark Jobserver Rest API☆39Updated 5 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆61Updated 7 months ago
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 4 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆56Updated 8 years ago
- Mirror of Apache Tephra (Incubating)☆32Updated 2 years ago
- Docker image for Apache Hive running on Tez☆25Updated 10 years ago
- A schema store service that tracks and manages all the schemas used in the Data Pipeline☆87Updated 4 years ago
- TPC-H Benchmark on Cloudera Impala☆19Updated 12 years ago
- Change Data Capture (CDC) toolkit for keeping system layers in sync with the database☆23Updated 8 years ago
- [EOL] Image build contents for Kubernetes applications.☆48Updated 7 years ago
- Flowmix is a flexible event processing engine for Apache Storm. It supports complex correlations of events via sliding/tumbling windows. …☆58Updated 9 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆110Updated 2 years ago
- ARCHIVED: Run Debezium/KafkaConnect CDC components in Kubernetes☆24Updated 6 years ago