4QuantOSS / sge_sparkLinks
The configuration and scripts for using spark within a Sun Grid Engine environment
☆24Updated 11 years ago
Alternatives and similar repositories for sge_spark
Users that are interested in sge_spark are comparing it to the libraries listed below
Sorting:
- Tools to manage jobs on supercomputer☆41Updated 10 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Tool to easily start up an IPython cluster on different schedulers.☆146Updated 5 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Updated 10 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 8 years ago
- ggplot2 syntax in python. Actually wrapper around Wickham's ggplot2 in R☆71Updated 4 years ago
- Caleydo - Visualization for Molecular Biology☆60Updated 9 years ago
- StarCluster is a utility for creating and managing computing clusters hosted on Amazon's Elastic Compute Cloud (EC2).☆36Updated 5 years ago
- Easily map Python functions onto a cluster using a DRMAA-compatible grid engine like Sun Grid Engine (SGE).☆84Updated 2 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 7 years ago
- Example Cloud Datalab iPython Notebooks for genomics use cases.☆25Updated 8 years ago
- A Cython interface to FLANN☆24Updated 5 years ago
- Everware is about re-useable science, it allows people to jump right in to your research code.☆115Updated 7 years ago
- permutation tests and confidence sets☆58Updated 3 months ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- PyRDM is a Python-based library for research data management (RDM). It facilitates the automated publication of scientific software and a…☆32Updated 4 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 10 years ago
- Structured Machine Learning in Python☆46Updated 2 years ago
- Hive Plots in using Python & matplotlib!☆71Updated 7 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 10 years ago
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- ☆92Updated 5 years ago
- Library for GPU-related statistical functions☆84Updated 13 years ago
- CGAT-ruffus is a lightweight python module for running computational pipelines☆175Updated 4 years ago
- ClusterJob: An automated system for painless and reproducible massive computational experiments☆20Updated last year
- A demo and tutorial for how to use git and singularity to make your research more portable and reproducible☆26Updated 9 years ago
- Tool for running/managing ad hoc spark clusters on a Slurm cluster☆25Updated 3 years ago
- Bayesian Factorization with Side Information in C++ with Python wrapper☆41Updated 4 years ago