hopshadoop / hdfscontents
A HDFS-backed ContentsManager implementation for IPython
☆24Updated 6 months ago
Alternatives and similar repositories for hdfscontents
Users that are interested in hdfscontents are comparing it to the libraries listed below
Sorting:
- An HDFS backed ContentsManager implementation for Jupyter☆12Updated last year
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆83Updated 5 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆50Updated this week
- A library for Spark DataFrame using MinIO Select API☆98Updated 5 years ago
- A tool and library for easily deploying applications on Apache YARN☆143Updated last year
- Jupyter extensions for SWAN☆58Updated last month
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Monitor Apache Spark from Jupyter Notebook☆172Updated 3 years ago
- A Spark metrics sink that pushes to InfluxDb☆51Updated 4 years ago
- ☆72Updated 4 years ago
- JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook☆92Updated 2 years ago
- Deploy dask on YARN clusters☆69Updated 9 months ago
- ☆15Updated 3 months ago
- ☆39Updated 6 years ago
- Point-in-Time optimizations for Apache Spark☆30Updated last year
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- PMML scoring library for Scala☆64Updated 3 months ago
- Utilities to work with Scala/Java code with py4j☆40Updated last year
- Python client for Hadoop® YARN API☆109Updated 2 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 4 years ago
- Docker image for main Apache Hadoop components (Yarn/Hdfs)☆56Updated 2 years ago
- Spark SQL index for Parquet tables☆134Updated 4 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Presto and Minio on Docker Infrastructure☆42Updated 6 years ago
- Quark is a data virtualization engine over analytic databases.☆97Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated last year
- Docker images used internally by various Teradata projects for automation, testing, etc☆40Updated 7 years ago