A wrapper for libhdfs3 to interact with HDFS from Python
☆137Feb 9, 2021Updated 5 years ago
Alternatives and similar repositories for hdfs3
Users that are interested in hdfs3 are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- API and command line interface for HDFS☆276Sep 24, 2024Updated last year
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆28Jul 6, 2015Updated 10 years ago
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- Cython based wrapper for libavro☆25Sep 14, 2020Updated 5 years ago
- Submit and execute distributed computations. A dask.distributed scheduler and Dispatcher.jl integration.☆14Dec 4, 2020Updated 5 years ago
- ☆10Apr 5, 2017Updated 8 years ago
- Go bindings for the Jupyter protocol☆10Jan 8, 2024Updated 2 years ago
- python implementation of the parquet columnar file format.☆890Updated this week
- OpenSSL convenience scripts☆12Feb 6, 2015Updated 11 years ago
- Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters☆10Dec 26, 2022Updated 3 years ago
- Backend implementation for running MLFlow projects on Hadoop/YARN.☆11Dec 27, 2022Updated 3 years ago
- Apache Mesos backend for Dask scheduling library☆28Oct 19, 2017Updated 8 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆156Nov 21, 2016Updated 9 years ago
- Start a cluster in EC2 for dask.distributed☆105Nov 3, 2020Updated 5 years ago
- Literate Computing for Reproducible Infrastructure - Hadoop Practice☆11Mar 5, 2026Updated 2 weeks ago
- Collection of dask example notebooks☆57Feb 14, 2018Updated 8 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Aug 10, 2020Updated 5 years ago
- A distributed task scheduler for Dask☆1,665Updated this week
- A pure python HDFS client☆858Apr 19, 2022Updated 3 years ago
- Extensible Python Framework for Apache Mesos☆33Oct 19, 2017Updated 8 years ago
- ☆37Feb 20, 2017Updated 9 years ago
- Ledger repo for Legacy Near app (refer to https://github.com/LedgerHQ/app-near)☆11Jul 10, 2024Updated last year
- Python bindings to librabbitmq using CFFI☆10Jun 7, 2017Updated 8 years ago
- JupyterHub proxy implementation with traefik☆59Mar 2, 2026Updated 2 weeks ago
- Data Migration for the Blaze Project☆1,004Jul 15, 2022Updated 3 years ago
- ☆19Jul 15, 2018Updated 7 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- HDFS client library for C☆45Mar 19, 2024Updated 2 years ago
- Deploy an interactive data science environment with JupyterHub on Docker Swarm☆21May 30, 2016Updated 9 years ago
- cloud platform monitoring system☆32Sep 11, 2012Updated 13 years ago
- Conda-driven Concourse CI for package building☆13Nov 3, 2023Updated 2 years ago
- Interactive programming for Atom☆13Jul 1, 2016Updated 9 years ago
- A Python MapReduce and HDFS API for Hadoop☆241Jan 19, 2026Updated 2 months ago
- reproducible executable environments☆444Oct 27, 2017Updated 8 years ago
- A tool and library for easily deploying applications on Apache YARN☆146Mar 12, 2024Updated 2 years ago
- This a mirror of the subversion repository on COIN-OR.☆10Feb 23, 2019Updated 7 years ago
- commandline manipulation of genomic variants and NGS reads☆19Sep 6, 2024Updated last year
- User Fault Objects: making vectors lazy and forgetful.☆13Sep 17, 2021Updated 4 years ago