wgzhao / easybase
Developer-friendly Python library to interact with Apache HBase, supports time range scan and multi-versions
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for easybase
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- ☆208Updated 8 years ago
- ☆14Updated 2 years ago
- Material of Clickhouse Meetup in China☆159Updated 5 years ago
- Guardian of Waterdrop and Spark☆30Updated last year
- Lightweight Azkaban client☆77Updated 4 years ago
- A UDF for Cloudera Impala ( hive get_json_object equivalent )☆32Updated 3 years ago
- Yarn on Docker - Managing Hadoop Yarn cluster with Docker Swarm.☆37Updated 2 years ago
- Scalable NameNode RPC Proxy for HDFS Federation☆84Updated 8 years ago
- Python HDFS client☆90Updated last month
- Apache Kylin Python Client Library☆63Updated last year
- A Spark SQL HBase connector☆29Updated 9 years ago
- This library is an ongoing effort towards bringing the data exchanging ability between Java/Scala and Python. PyJava introduces Apache A…☆46Updated last year
- Plugin for Presto to allow addition of user functions easily☆116Updated 3 years ago
- Hive UDFs for funnel analysis☆84Updated last year
- kudu学习的一些资料,以及和spark/impala的集成使用☆33Updated 7 years ago
- A library for querying Binlog with Apache Spark structure streaming, for Spark SQL , DataFrames and [MLSQL](https://www.mlsql.tech).☆154Updated last year
- NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.☆113Updated 3 months ago
- python library for reusable client connections☆27Updated 5 years ago
- redis realtime sync tool, like mysql canal☆17Updated 7 years ago
- ☆27Updated 3 years ago
- loading hdfs data to clickhouse☆73Updated 2 years ago
- A plugin to the Kafka Connect framework that replicates data from MySQL to Kafka☆97Updated 8 years ago
- A sample of Flink TiDB Realtime Datawarehouse.☆83Updated 3 years ago
- ☆77Updated 6 years ago
- Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server☆21Updated 7 years ago
- Data Pipeline Clientlib provides an interface to tail and publish to data pipeline topics.☆109Updated 2 years ago
- Flink: Stateful Computations over Data Streams☆15Updated 6 years ago
- GeoIP Functions for hive☆48Updated 4 years ago