4QuantOSS / sge_sparkLinks
The configuration and scripts for using spark within a Sun Grid Engine environment
☆24Updated 11 years ago
Alternatives and similar repositories for sge_spark
Users that are interested in sge_spark are comparing it to the libraries listed below
Sorting:
- Tools to manage jobs on supercomputer☆40Updated 10 years ago
- Tool to easily start up an IPython cluster on different schedulers.☆147Updated 5 years ago
- StarCluster is a utility for creating and managing computing clusters hosted on Amazon's Elastic Compute Cloud (EC2).☆36Updated 5 years ago
- VisTrails is an open-source data analysis and visualization tool. It provides a comprehensive provenance infrastructure that maintains de…☆104Updated 7 years ago
- ggplot2 syntax in python. Actually wrapper around Wickham's ggplot2 in R☆71Updated 4 years ago
- Slides and materials for workshop at University of Michigan MICDE☆19Updated 9 years ago
- scikit-learn addon to operate on set/"group"-based features☆41Updated 9 years ago
- ClusterJob: An automated system for painless and reproducible massive computational experiments☆20Updated last year
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 7 years ago
- Deploy Dask on DRMAA clusters☆40Updated 4 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Updated 10 years ago
- Magpie contains a number of scripts for running Big Data software in HPC environments, including Hadoop and Spark. There is support for L…☆198Updated 8 months ago
- Hive Plots in using Python & matplotlib!☆71Updated 7 years ago
- Ready-to-go Parquet-formatted public 'omics datasets☆30Updated 9 years ago
- %conda magic for IPython☆28Updated 8 years ago
- CGAT-ruffus is a lightweight python module for running computational pipelines☆174Updated 4 years ago
- permutation tests and confidence sets☆58Updated last month
- A demo and tutorial for how to use git and singularity to make your research more portable and reproducible☆26Updated 9 years ago
- scientific filesystem: a filesystem organization for scientific software and metadata☆34Updated last year
- Everware is about re-useable science, it allows people to jump right in to your research code.☆116Updated 7 years ago
- ☆28Updated 8 years ago
- Tool for running/managing ad hoc spark clusters on a Slurm cluster☆25Updated 3 years ago
- Library of common tools for machine learning research.☆41Updated 7 years ago
- StarCluster is an open source cluster-computing toolkit for Amazon's Elastic Compute Cloud (EC2).☆582Updated 3 years ago
- ☆92Updated 5 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Library of composable generative population models which serve as the modeling and inference backend of BayesDB.☆25Updated last year
- Caleydo - Visualization for Molecular Biology☆60Updated 8 years ago
- Resources for a talk about streaming data analysis in Python☆15Updated 10 years ago
- Python edition of ActivePapers☆41Updated last year