4QuantOSS / sge_sparkLinks
The configuration and scripts for using spark within a Sun Grid Engine environment
☆24Updated 11 years ago
Alternatives and similar repositories for sge_spark
Users that are interested in sge_spark are comparing it to the libraries listed below
Sorting:
- Tools to manage jobs on supercomputer☆40Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Tool to easily start up an IPython cluster on different schedulers.☆146Updated 5 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 7 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Updated 10 years ago
- ggplot2 syntax in python. Actually wrapper around Wickham's ggplot2 in R☆71Updated 4 years ago
- ClusterJob: An automated system for painless and reproducible massive computational experiments☆20Updated last year
- Low-level primitives for collapsed Gibbs sampling in python and C++☆33Updated last year
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- permutation tests and confidence sets☆58Updated last year
- Everware is about re-useable science, it allows people to jump right in to your research code.☆116Updated 7 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 9 years ago
- Materials and Jekyll website for the Wednesday software working group.☆10Updated 8 years ago
- A demo and tutorial for how to use git and singularity to make your research more portable and reproducible☆26Updated 8 years ago
- Default Repo description from terraform module☆5Updated 5 years ago
- Python edition of ActivePapers☆41Updated last year
- A Cython interface to FLANN☆24Updated 4 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 13 years ago
- Library for GPU-related statistical functions☆84Updated 12 years ago
- 💻 Material for a course on applied machine-learning for scientists. Taught at EPFL in spring 2017☆23Updated 8 years ago
- StarCluster is a utility for creating and managing computing clusters hosted on Amazon's Elastic Compute Cloud (EC2).☆36Updated 5 years ago
- Caleydo - Visualization for Molecular Biology☆59Updated 8 years ago
- DEPRECATED. Please visit our new repository (cytoscape/cyREST)☆27Updated 8 years ago
- scientific filesystem: a filesystem organization for scientific software and metadata☆34Updated last year
- Documented examples of Jupyterhub deployment in HPC settings☆36Updated last year
- %conda magic for IPython☆28Updated 8 years ago