Python DB API 2.0 client for Impala and Hive (HiveServer2 protocol)
โ742Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for impyla
Users that are interested in impyla are comparing it to the libraries listed below
Sorting:
- Python interface to Hive and Presto. ๐โ1,695Aug 7, 2024Updated last year
- โ208Apr 28, 2016Updated 9 years ago
- C++ native client for Impala and Hive, with Python / pandas bindingsโ72Aug 15, 2018Updated 7 years ago
- Real-time Query for Hadoop; mirror of Apache Impalaโ34Dec 27, 2022Updated 3 years ago
- Open source SQL Query Assistant service for Databases/Warehousesโ1,447Updated this week
- API and command line interface for HDFSโ276Sep 24, 2024Updated last year
- [UNMAINTAINED] A developer-friendly Python library to interact with Apache HBaseโ612Feb 23, 2026Updated last week
- the portable Python dataframe libraryโ6,417Updated this week
- Python client for Apache Kafkaโ5,890Feb 25, 2026Updated last week
- Mirror of Apache Kuduโ1,898Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhereโ1,007Oct 5, 2022Updated 3 years ago
- Apache Airflow - A platform to programmatically author, schedule, and monitor workflowsโ44,430Updated this week
- A pure python HDFS clientโ859Apr 19, 2022Updated 3 years ago
- Jupyter magics and kernels for working with remote Spark clustersโ1,362Sep 9, 2025Updated 5 months ago
- A non-validating SQL parser module for Pythonโ3,995Dec 19, 2025Updated 2 months ago
- Mirror of Apache Toree (Incubating)โ749Feb 21, 2026Updated last week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orgaโฆโ2,260Feb 19, 2026Updated last week
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Sparkโ1,371Aug 22, 2023Updated 2 years ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.โ2,308Updated this week
- File compaction tool that runs on top of the Spark framework.โ59Apr 17, 2019Updated 6 years ago
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visโฆโ18,681Feb 25, 2026Updated last week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.โ6,605Updated this week
- A Maven-based example of using Cloudera Impala's JDBC driverโ118May 10, 2016Updated 9 years ago
- Elasticsearch real-time search and analytics natively integrated with Hadoopโ2,038Updated this week
- Mirror of Apache Sqoopโ979Apr 8, 2021Updated 4 years ago
- Apache Hiveโ6,002Updated this week
- Apache Druid: a high performance real-time analytics database.โ13,942Updated this week
- Parallel computing with task schedulingโ13,754Updated this week
- Apache Kafka client for Python; high-level & low-level consumer/producer, with great performance.โ1,118Jan 27, 2021Updated 5 years ago
- ๅทฒ็ปๅๅ ฅ(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.โ58Nov 11, 2021Updated 4 years ago
- Apache Impalaโ1,267Updated this week
- LinkedIn's previous generation Kafka to HDFS pipeline.โ884Aug 27, 2020Updated 5 years ago
- A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | ้กน็ฎๅทฒ่ฟ็งป่ณ Apaโฆโ183Apr 6, 2022Updated 3 years ago
- CMAK is a tool for managing Apache Kafka clustersโ11,948Aug 2, 2023Updated 2 years ago
- An open source indexing subsystem that brings index-based query acceleration to Apache Sparkโข and big data workloads.โ431Jan 14, 2022Updated 4 years ago
- Apache Spark - A unified analytics engine for large-scale data processingโ42,898Updated this week
- Apache Superset is a Data Visualization and Data Exploration Platformโ70,755Updated this week
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sโฆโ8,475Feb 5, 2026Updated 3 weeks ago
- The official home of the Presto distributed SQL query engine for big dataโ16,662Feb 24, 2026Updated last week