spotify / snakebite
A pure python HDFS client
☆855Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for snakebite
- API and command line interface for HDFS☆270Updated last month
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆731Updated 2 weeks ago
- A Python MapReduce and HDFS API for Hadoop☆237Updated 10 months ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,010Updated 2 years ago
- ☆208Updated 8 years ago
- Python interface to Hive and Presto. 🐝☆1,671Updated 3 months ago
- A developer-friendly Python library to interact with Apache HBase☆612Updated 3 months ago
- A Python connector for Druid☆511Updated 3 months ago
- Python DB-API client for Presto☆239Updated 11 months ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆136Updated 3 years ago
- Mirror of Apache Toree (Incubating)☆740Updated last week
- LinkedIn's previous generation Kafka to HDFS pipeline.☆881Updated 4 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆517Updated 4 years ago
- Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.☆1,119Updated 3 years ago
- ☆511Updated 2 years ago
- python implementation of the parquet columnar file format.☆341Updated 3 years ago
- Iceberg is a table format for large, slow-moving tabular data☆479Updated last year
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,354Updated last year
- Read - Write JSON SerDe for Apache Hive.☆733Updated 11 months ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,138Updated last year
- A connector for Spark that allows reading and writing to/from Redis cluster☆940Updated 3 weeks ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆280Updated 5 years ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- DataStax Connector for Apache Spark to Apache Cassandra☆1,943Updated 2 months ago
- Kite SDK☆394Updated 2 years ago
- Examples for High Performance Spark☆503Updated 2 weeks ago