A wrapper for libhdfs3 to interact with HDFS from Python
☆137Feb 9, 2021Updated 5 years ago
Alternatives and similar repositories for hdfs3
Users that are interested in hdfs3 are comparing it to the libraries listed below
Sorting:
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- API and command line interface for HDFS☆276Sep 24, 2024Updated last year
- Deploy Dask on Marathon☆10Feb 6, 2017Updated 9 years ago
- libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.☆28Jul 6, 2015Updated 10 years ago
- OpenSSL convenience scripts☆12Feb 6, 2015Updated 11 years ago
- python implementation of the parquet columnar file format.☆889Jan 6, 2026Updated last month
- Python Client for WebHDFS REST API☆43May 8, 2015Updated 10 years ago
- Submit and execute distributed computations. A dask.distributed scheduler and Dispatcher.jl integration.☆14Dec 4, 2020Updated 5 years ago
- Collection of dask example notebooks☆57Feb 14, 2018Updated 8 years ago
- A distributed task scheduler for Dask☆1,666Updated this week
- A Python MapReduce and HDFS API for Hadoop☆242Jan 19, 2026Updated last month
- Apache Mesos backend for Dask scheduling library☆28Oct 19, 2017Updated 8 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Sep 26, 2017Updated 8 years ago
- A pure python HDFS client☆859Apr 19, 2022Updated 3 years ago
- Concurrent appendable key-value storage☆107Jul 15, 2024Updated last year
- Partitioned storage system based on blosc. **No longer actively maintained.**☆156Nov 21, 2016Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Jul 13, 2016Updated 9 years ago
- Deploy an interactive data science environment with JupyterHub on Docker Swarm☆21May 30, 2016Updated 9 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Aug 10, 2020Updated 5 years ago
- JupyterHub Playbook for the Computational Models class at Berkeley☆82Sep 27, 2016Updated 9 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Jul 23, 2020Updated 5 years ago
- Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters☆10Dec 26, 2022Updated 3 years ago
- Python bindings to librabbitmq using CFFI☆10Jun 7, 2017Updated 8 years ago
- ☆10Mar 18, 2021Updated 4 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- Run-length compressed BWT with LZ77 sampled suffix array☆10Apr 25, 2022Updated 3 years ago
- EXPERIMENTAL implementation of side graph☆10Apr 16, 2015Updated 10 years ago
- Ledger repo for Legacy Near app (refer to https://github.com/LedgerHQ/app-near)☆11Jul 10, 2024Updated last year
- Experimental, high-performance GPU-accelerated rasterizer for common Web content☆11Jul 16, 2015Updated 10 years ago
- My personal tutorials to dive into Python in an hour or so☆10Jul 15, 2016Updated 9 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Aug 15, 2018Updated 7 years ago
- PyBoiler is a simple python 2.7 script to create a project template in a given directory.☆10Jun 15, 2016Updated 9 years ago
- Phosphor-based jupyter notebook☆11Aug 8, 2015Updated 10 years ago
- Python code examples for working with the Slack API. 2.x and 3.x compatible code.☆13May 19, 2016Updated 9 years ago
- ☆19Aug 30, 2013Updated 12 years ago
- Backend implementation for running MLFlow projects on Hadoop/YARN.☆11Dec 27, 2022Updated 3 years ago
- Reusable code for Hive☆16Aug 19, 2014Updated 11 years ago
- Extensible Python Framework for Apache Mesos☆33Oct 19, 2017Updated 8 years ago
- %conda magic for IPython☆28Mar 16, 2017Updated 8 years ago