dask/hdfs3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dask/hdfs3)

dask / hdfs3

A wrapper for libhdfs3 to interact with HDFS from Python

☆137

Alternatives and similar repositories for hdfs3

Users that are interested in hdfs3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dask / knit
View on GitHub
Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
☆54Jul 3, 2018Updated 8 years ago
ContinuumIO / libhdfs3-downstream
View on GitHub
a native c/c++ hdfs client (downstream fork from apache-hawq)
☆40Jun 25, 2026Updated 3 weeks ago
haohui / libhdfspp
View on GitHub
libhdfs++ is a modern implementation of HDFS client in C++11 that is designed for the Massive Parallel Processing (MPP) applications.
☆28Jul 6, 2015Updated 11 years ago
mrocklin / dask-marathon
View on GitHub
Deploy Dask on Marathon
☆10Feb 6, 2017Updated 9 years ago
invenia / DaskDistributedDispatcher.jl
View on GitHub
Submit and execute distributed computations. A dask.distributed scheduler and Dispatcher.jl integration.
☆14Dec 4, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jreback / PandasTalks
View on GitHub
☆10Apr 5, 2017Updated 9 years ago
moutai / sparkonda
View on GitHub
Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters
☆10Dec 26, 2022Updated 3 years ago
dask / fastparquet
View on GitHub
python implementation of the parquet columnar file format.
☆900Jun 29, 2026Updated 3 weeks ago
rgbkrk / juno
View on GitHub
Go bindings for the Jupyter protocol
☆10Jan 8, 2024Updated 2 years ago
solvuu / phat
View on GitHub
Strongly typed file path and file system operations.
☆26Jun 11, 2026Updated last month
cloudpipe / keymaster
View on GitHub
OpenSSL convenience scripts
☆12Feb 6, 2015Updated 11 years ago
criteo / mlflow-yarn
View on GitHub
Backend implementation for running MLFlow projects on Hadoop/YARN.
☆11Dec 27, 2022Updated 3 years ago
daskos / daskos
View on GitHub
Apache Mesos backend for Dask scheduling library
☆28Oct 19, 2017Updated 8 years ago
blaze / castra
View on GitHub
Partitioned storage system based on blosc. **No longer actively maintained.**
☆157Nov 21, 2016Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alexsmith1612 / hadoofus
View on GitHub
HDFS client library for C
☆45Mar 19, 2024Updated 2 years ago
dask / dask-ec2
View on GitHub
Start a cluster in EC2 for dask.distributed
☆105Nov 3, 2020Updated 5 years ago
dask / old-dask-examples
View on GitHub
Collection of dask example notebooks
☆57Feb 14, 2018Updated 8 years ago
NII-cloud-operation / Literate-computing-Hadoop
View on GitHub
Literate Computing for Reproducible Infrastructure - Hadoop Practice
☆11Mar 5, 2026Updated 4 months ago
pywebhdfs / pywebhdfs
View on GitHub
Pure Python wrapper for the Hadoop WebHDFS Rest API
☆52Aug 10, 2020Updated 5 years ago
dask / distributed
View on GitHub
A distributed task scheduler for Dask
☆1,675Updated this week
spotify / snakebite
View on GitHub
A pure python HDFS client
☆857Apr 19, 2022Updated 4 years ago
daskos / mentor
View on GitHub
Extensible Python Framework for Apache Mesos
☆33Oct 19, 2017Updated 8 years ago
minrk / release-page
View on GitHub
☆10Mar 18, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LedgerHQ / app-near-legacy
View on GitHub
Ledger repo for Legacy Near app (refer to https://github.com/LedgerHQ/app-near)
☆11Jul 10, 2024Updated 2 years ago
jupyterhub / traefik-proxy
View on GitHub
JupyterHub proxy implementation with traefik
☆59Jul 1, 2026Updated 2 weeks ago
jbg / rabbitmq
View on GitHub
Python bindings to librabbitmq using CFFI
☆10Jun 7, 2017Updated 9 years ago
conda-archive / kapsel
View on GitHub
☆37Feb 20, 2017Updated 9 years ago
blaze / odo
View on GitHub
Data Migration for the Blaze Project
☆1,006Jul 15, 2022Updated 4 years ago
zaratsian / HDP_Tuning_Unofficial
View on GitHub
Collection of HDP Tuning Tricks & Tips (unofficial guide)
☆17Sep 26, 2017Updated 8 years ago
lh3 / sdg
View on GitHub
EXPERIMENTAL implementation of side graph
☆10Apr 16, 2015Updated 11 years ago
jakevdp / OpenVisConf2014
View on GitHub
My Talk for the 2014 OpenVisConf, April 24-25 in Boston, MA
☆16Apr 26, 2014Updated 12 years ago
getcarina / jupyterhub-tutorial
View on GitHub
Deploy an interactive data science environment with JupyterHub on Docker Swarm
☆21May 30, 2016Updated 10 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
RTBHOUSE / chromium-fledge-tests
View on GitHub
☆13Jul 2, 2026Updated 2 weeks ago
gnestor / magic-console
View on GitHub
Interactive programming for Atom
☆13Jul 1, 2016Updated 10 years ago
iskandr / parakeet
View on GitHub
Runtime compiler for numerical Python
☆235Jan 31, 2025Updated last year
nicolaprezza / lz-rlbwt
View on GitHub
Run-length compressed BWT with LZ77 sampled suffix array
☆10Apr 25, 2022Updated 4 years ago
minrk / jskernel
View on GitHub
☆24Mar 16, 2015Updated 11 years ago
jcrist / skein
View on GitHub
A tool and library for easily deploying applications on Apache YARN
☆145Mar 12, 2024Updated 2 years ago
binder-project / binder
View on GitHub
reproducible executable environments
☆444Oct 27, 2017Updated 8 years ago