C++ native client for Impala and Hive, with Python / pandas bindings
☆72Aug 15, 2018Updated 7 years ago
Alternatives and similar repositories for hs2client
Users that are interested in hs2client are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)☆740Jul 31, 2025Updated 7 months ago
- Fork of Cloudera Impala separated from Hadoop☆42Jul 13, 2016Updated 9 years ago
- Thin REST-API for Impala with Redis caching☆15Aug 4, 2016Updated 9 years ago
- Python bindings for the NVML. Non-volatile memory for Python.☆12May 23, 2016Updated 9 years ago
- Sample UDF and UDAs for Impala.☆63Sep 19, 2025Updated 6 months ago
- Cython based wrapper for libavro☆25Sep 14, 2020Updated 5 years ago
- Mirror of Apache Tephra (Incubating)☆32Apr 17, 2023Updated 2 years ago
- Real-time query spark and visualise it as graph.☆24Oct 4, 2017Updated 8 years ago
- Demo notebook of Ibis for "Spark + Python + Dita science Festival"☆12Jul 28, 2016Updated 9 years ago
- Simple spill-to-disk dictionary☆18May 24, 2016Updated 9 years ago
- Track public endpoints and connections across AWS accounts using VPC Flow Logs☆12Jun 14, 2016Updated 9 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Aug 20, 2017Updated 8 years ago
- A d3.js library to produce flame graphs.☆12Sep 24, 2018Updated 7 years ago
- Interactive performance benchmarking in Jupyter☆33Dec 2, 2024Updated last year
- Apache Parquet format for Rust, hosting the Thrift definition file and the generated .rs file☆18Jul 6, 2022Updated 3 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆655Mar 1, 2026Updated 3 weeks ago
- Machine learning evaluation database☆24Feb 7, 2018Updated 8 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆282Feb 27, 2019Updated 7 years ago
- An Ansible module for managing Python packages via Conda☆55Mar 4, 2024Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Nov 7, 2025Updated 4 months ago
- Cache File System optimized for columnar formats and object stores☆187Aug 11, 2022Updated 3 years ago
- SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.☆46Jul 8, 2018Updated 7 years ago
- XR-style Interface to Python (from "Extending R")☆18Apr 14, 2024Updated last year
- ☆18Jan 17, 2025Updated last year
- IOManager tries to bridge the gap in existing async framework to build full async networked database/storage/keyvalue storage☆11Feb 7, 2026Updated last month
- ☆15Jan 31, 2018Updated 8 years ago
- Make a quick reaction game in Python with some simple electronics and your Raspberry Pi☆16Jul 9, 2025Updated 8 months ago
- A compiler and runtime for Google's Sawzall language, optimized for Hadoop☆41Apr 26, 2013Updated 12 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Jul 13, 2016Updated 9 years ago
- Analyzing NBA Data☆11Feb 19, 2015Updated 11 years ago
- Common C++ client that accesses HBase cluster through HBase ThriftServer.☆20Aug 29, 2012Updated 13 years ago
- A pure python HDFS client☆858Apr 19, 2022Updated 3 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Apr 9, 2020Updated 5 years ago
- DuckDB extension for MySQL☆15Mar 17, 2024Updated 2 years ago
- Introduction to Functional programming talk from Pittsburgh TechFest☆39Jun 4, 2013Updated 12 years ago
- Apache Kylin Python Client Library☆63Apr 21, 2023Updated 2 years ago
- Task scheduling and blocked algorithms for parallel processing☆17Jan 5, 2026Updated 2 months ago