4QuantOSS / sge_sparkLinks
The configuration and scripts for using spark within a Sun Grid Engine environment
☆24Updated 11 years ago
Alternatives and similar repositories for sge_spark
Users that are interested in sge_spark are comparing it to the libraries listed below
Sorting:
- Tools to manage jobs on supercomputer☆40Updated 9 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Updated 9 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Tool to easily start up an IPython cluster on different schedulers.☆147Updated 5 years ago
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 7 years ago
- permutation tests and confidence sets☆58Updated last year
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 8 years ago
- Computational reproducibility using Continuous Integration to produce verifiable end-to-end runs of scientific analysis.☆82Updated 4 years ago
- Generation of CWL programmatically. Available types: CommandLineTool and DockerRequirement☆29Updated 5 years ago
- Everware is about re-useable science, it allows people to jump right in to your research code.☆116Updated 7 years ago
- ggplot2 syntax in python. Actually wrapper around Wickham's ggplot2 in R☆71Updated 3 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- DEPRECATED. Please visit our new repository (cytoscape/cyREST)☆27Updated 8 years ago
- StarCluster is a utility for creating and managing computing clusters hosted on Amazon's Elastic Compute Cloud (EC2).☆36Updated 5 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 7 years ago
- A demo and tutorial for how to use git and singularity to make your research more portable and reproducible☆26Updated 8 years ago
- Example Cloud Datalab iPython Notebooks for genomics use cases.☆25Updated 7 years ago
- An accelerated framework for manipulating and interpreting high-throughput sequencing data☆26Updated 12 years ago
- ☆16Updated 8 years ago
- ClusterJob: An automated system for painless and reproducible massive computational experiments☆20Updated last year
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Efficient storage of same-type, uneven-size arrays☆12Updated 6 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Tree hidden Markov model for learning epigenetic states in multiple cell types☆28Updated 12 years ago
- FDA High-performance Integrated Virtual Environment (HIVE)☆38Updated 4 years ago
- GURLS: a Least Squares Library for Supervised Learning☆62Updated 9 years ago
- Default Repo description from terraform module☆5Updated 5 years ago
- Easily map Python functions onto a cluster using a DRMAA-compatible grid engine like Sun Grid Engine (SGE).☆84Updated 2 years ago
- a machine learning platform for teams☆19Updated 7 years ago