☆208Apr 28, 2016Updated 9 years ago
Alternatives and similar repositories for pyhs2
Users that are interested in pyhs2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆740Jul 31, 2025Updated 7 months ago
- Python interface to Hive and Presto. 🐝☆1,693Aug 7, 2024Updated last year
- GeoIP Functions for hive☆48Oct 13, 2020Updated 5 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Aug 15, 2018Updated 7 years ago
- Multicorn based PostgreSQL Foreign Data Wrapper for Treasure Data☆12Jan 1, 2017Updated 9 years ago
- Python client for Hadoop® YARN API☆109Sep 26, 2022Updated 3 years ago
- Spark Example using Phoenix to interact with HBase☆16Nov 2, 2016Updated 9 years ago
- Oozie Samples☆51Jan 11, 2014Updated 12 years ago
- ACL Management for Apache Spark SQL with Apache Ranger☆17Jun 18, 2020Updated 5 years ago
- A pure python HDFS client☆858Apr 19, 2022Updated 3 years ago
- API and command line interface for HDFS☆276Sep 24, 2024Updated last year
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- [UNMAINTAINED] A developer-friendly Python library to interact with Apache HBase☆611Mar 16, 2026Updated last week
- ☆20Sep 25, 2023Updated 2 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆73Feb 11, 2017Updated 9 years ago
- Open source SQL Query Assistant service for Databases/Warehouses☆1,449Updated this week
- Demonstrates how to develop an Oozie workflow application and aim's to show-case Oozie's features.☆32Apr 12, 2022Updated 3 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Apr 18, 2017Updated 8 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Helpful user defined fuctions / table generating functions for Hive☆102May 2, 2016Updated 9 years ago
- Decoding Raymarine's ARCHIVE.FSH files, Garmin's IMG/ADM archives and the TRK subfiles.☆10Oct 22, 2019Updated 6 years ago
- Scripts for building Cloudera Manager parcel and CSD for Livy Spark Server☆21Oct 18, 2017Updated 8 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstation☆23Apr 18, 2016Updated 9 years ago
- Jupyter notebook containing code from text preprocessing blog post☆10Nov 29, 2016Updated 9 years ago
- Set of hadoop input/output formats for use in combination with hadoop streaming☆32Jul 28, 2017Updated 8 years ago
- Python HDFS client☆96Jan 31, 2026Updated last month
- Spark Application : Spark Summit 2018 : Streaming Trend Discovery☆11Jun 7, 2018Updated 7 years ago
- Kafka Graphite Metrics Reporter☆94Oct 7, 2021Updated 4 years ago
- Puppet module to deploy Cloudera Manager and Cloudera's Distribution, including Apache Hadoop (CDH).☆34Nov 5, 2021Updated 4 years ago
- Luigi Workflow Engine integration for Treasure Data☆16May 14, 2018Updated 7 years ago
- Read - Write JSON SerDe for Apache Hive.☆738Nov 28, 2023Updated 2 years ago
- Scalable machine learning library for Apache Hive/Spark/Pig☆501Dec 2, 2016Updated 9 years ago
- Apache Kylin☆3,766Mar 13, 2026Updated last week
- A Java implementation of SpamSum / SSDeep☆14Jan 9, 2017Updated 9 years ago
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,607Updated this week
- A simple RELP library for Go☆11Apr 7, 2020Updated 5 years ago
- Python DBAPI Driver and Sqlalchemy Dialect for Apache Kylin, the "Extreme OLAP Engine for Big Data"☆50Dec 20, 2017Updated 8 years ago
- Apache Hive☆6,014Updated this week
- PostgreSQL extension which visualizes a plan tree using Graphviz☆13Aug 11, 2020Updated 5 years ago