modelop / augustus
Augustus is an open source system for building and scoring statistical models designed to work with data sets that are too large to fit into memory
☆43Updated 11 years ago
Alternatives and similar repositories for augustus:
Users that are interested in augustus are comparing it to the libraries listed below
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- ☆24Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 7 years ago
- Apache Zeppelin on Kubernetes.☆28Updated 5 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14Updated 8 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Create and manage instances for data science☆20Updated 8 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- MaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.☆14Updated 2 years ago
- self organizing map and variations implemented in Spark☆9Updated 8 years ago
- ☆31Updated 4 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 6 years ago
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆17Updated 10 years ago
- pythonic access to fastbit☆26Updated 6 years ago
- Chef Cookbook for Hopsworks☆12Updated 2 months ago
- S3 backed ContentsManager for jupyter notebooks☆13Updated 8 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- A Pachyderm deep learning tutorial for conference workshops☆19Updated 7 years ago
- Templates for projects based on top of H2O.☆37Updated 2 months ago
- analytics tool kit☆43Updated 7 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 3 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Infrastructure setup.☆11Updated 5 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- An implementation of Nextflow.io with Language Workbench Technology. The project helps create computational pipelines that run with the N…☆22Updated 8 years ago