Spirals-Team / hadoop-benchmarkLinks
Docker containers to build an Hadoop infrastructure and experiment feedback control loops atop of it.
☆9Updated 7 years ago
Alternatives and similar repositories for hadoop-benchmark
Users that are interested in hadoop-benchmark are comparing it to the libraries listed below
Sorting:
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14Updated 9 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆17Updated 10 years ago
- ☆41Updated 8 years ago
- ☆24Updated 10 years ago
- ☆12Updated 9 years ago
- Trending on Accumulo☆40Updated 12 years ago
- Mirror of Apache MRQL (Incubating)☆17Updated 7 years ago
- Large-scale ML & graph analytics on Giraph☆78Updated 9 years ago
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Updated 6 years ago
- HIPI: Hadoop Image Processing Interface☆133Updated 8 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 4 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Portable Format for Analytics☆27Updated 8 years ago
- ☆27Updated 10 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 6 years ago
- ☆15Updated 7 years ago
- Spark exploration☆19Updated 10 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- Weka on Spark☆32Updated 7 years ago
- A Java Toolbox for Scalable Probabilistic Machine Learning☆121Updated last year
- analytics tool kit☆43Updated 8 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- An open source ML system for the end-to-end data science lifecycle☆37Updated 4 years ago
- CS294 RISE Course Material☆32Updated 6 years ago
- SAMOA (Scalable Advanced Massive Online Analysis) is an open-source platform for mining big data streams.☆425Updated 9 years ago
- Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.☆37Updated last year
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago