Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
☆53Jul 3, 2018Updated 7 years ago
Alternatives and similar repositories for knit
Users that are interested in knit are comparing it to the libraries listed below
Sorting:
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Feb 9, 2021Updated 5 years ago
- Cython based wrapper for libavro☆25Sep 14, 2020Updated 5 years ago
- Share & re-use Jupyter Notebooks☆12Sep 4, 2024Updated last year
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Aug 15, 2018Updated 7 years ago
- Minimalistic utility library to manage conda environments for pyspark jobs on yarn clusters☆10Dec 26, 2022Updated 3 years ago
- Dask and Spark interactions☆21Mar 13, 2017Updated 9 years ago
- ☆19Jul 15, 2018Updated 7 years ago
- A multi-tenant server for securely deploying and managing Dask clusters.☆143Mar 2, 2026Updated 2 weeks ago
- Add conda activation to an IPython kernel spec☆10Mar 12, 2019Updated 7 years ago
- Provides a unified interface to dealing with Conda environments.☆94May 15, 2017Updated 8 years ago
- Unified interface for local and distributed ndarrays☆157Oct 13, 2018Updated 7 years ago
- ☆37Feb 20, 2017Updated 9 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Mar 31, 2015Updated 10 years ago
- Extensible Python Framework for Apache Mesos☆33Oct 19, 2017Updated 8 years ago
- Resolve data table conflicts☆17Jun 11, 2015Updated 10 years ago
- API for converting JVM objects to representations by MIME type, for the Jupyter ecosystem.☆25Jan 16, 2020Updated 6 years ago
- Convert pure-Python wheels to conda packages (experimental)☆11May 11, 2018Updated 7 years ago
- 🚧⛔ Tools help with continuous integration on services such as travis-ci and appveyor. (mostly replaced by conda-smithy)☆14Jun 20, 2018Updated 7 years ago
- Useful Mutable Mappings☆72Oct 31, 2023Updated 2 years ago
- dask-searchcv is now part of dask-ml: https://github.com/dask/dask-ml☆241Oct 13, 2018Updated 7 years ago
- Dockerfile for Apache Zeppelin☆17Dec 9, 2015Updated 10 years ago
- Source Material for using Python and Hadoop together☆13Mar 14, 2017Updated 9 years ago
- Start a cluster in EC2 for dask.distributed☆105Nov 3, 2020Updated 5 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆156Nov 21, 2016Updated 9 years ago
- Python library for configuring a package including defaults, env variable loading, and yaml loading.☆42Mar 3, 2026Updated 2 weeks ago
- General purpose, language-agnostic Continuous Benchmarking (CB) framework☆35Apr 15, 2020Updated 5 years ago
- ☆19Jan 6, 2019Updated 7 years ago
- This repository is deprecated.☆12Aug 26, 2017Updated 8 years ago
- A Python library for dealing with splittable files☆42Dec 10, 2019Updated 6 years ago
- Topics of conferences☆12Jul 12, 2016Updated 9 years ago
- Automatically generate Cython pxd files from C headers☆47Mar 24, 2018Updated 7 years ago
- VariantSpark is a framework for applying Spark-based Machine Learning methods to whole-genome variant information☆33Sep 28, 2017Updated 8 years ago
- The fundamental package for scientific computing with Python.☆22Dec 23, 2023Updated 2 years ago
- 🔥 binders☆10Mar 4, 2018Updated 8 years ago
- Chef cookbook for Continuum Analytic's Anaconda: "completely free Python distribution for large-scale data processing, predictive analyti…☆15Nov 21, 2018Updated 7 years ago
- Dockerized setup for testing code on realistic hadoop clusters☆26Jul 20, 2020Updated 5 years ago
- Machines and people collaborating together through Jupyter notebooks.☆18Aug 24, 2017Updated 8 years ago
- ☆31May 23, 2018Updated 7 years ago
- A general scraper/exporter for matplotlib plots☆49Jan 2, 2026Updated 2 months ago