mtth / hdfs
API and command line interface for HDFS
☆272Updated 5 months ago
Alternatives and similar repositories for hdfs:
Users that are interested in hdfs are comparing it to the libraries listed below
- A Python MapReduce and HDFS API for Hadoop☆238Updated last month
- A developer-friendly Python library to interact with Apache HBase☆607Updated 7 months ago
- ☆209Updated 8 years ago
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆734Updated last week
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 4 years ago
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Python DB-API client for Presto☆238Updated last year
- A Python connector for Druid☆513Updated 7 months ago
- A pure python HDFS client☆857Updated 2 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 4 years ago
- Python HDFS client☆93Updated last week
- Lightweight Azkaban client☆77Updated 5 years ago
- python implementation of the parquet columnar file format.☆349Updated 3 years ago
- A collection of examples using flinks new python API☆244Updated 6 years ago
- Python interface to Hive and Presto. 🐝☆1,678Updated 7 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,005Updated 2 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- Apache (Py)Spark type annotations (stub files).☆116Updated 2 years ago
- DEPRECATED - HBase Stargate (REST API) client wrapper for Python.☆53Updated 6 years ago
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆268Updated 6 months ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- Mirror of Apache Toree (Incubating)☆741Updated last month
- Spark package for checking data quality☆221Updated 5 years ago
- Cloudera Manager API Client☆306Updated last year
- A process that runs in unison with Apache Airflow to control the Scheduler process to ensure High Availability☆233Updated 2 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆283Updated 6 years ago
- ☆519Updated 3 years ago
- The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.☆553Updated 3 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 2 years ago
- Hive UDFs for funnel analysis☆83Updated 2 years ago