bigdatagenomics / adam
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
☆996Updated 3 weeks ago
Related projects: ⓘ
- Simplifying robust end-to-end machine learning on Apache Spark.☆468Updated 7 years ago
- Mirror of Apache Toree (Incubating)☆737Updated 2 weeks ago
- CSV Data Source for Apache Spark 1.x☆1,053Updated 5 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆961Updated 2 months ago
- ☆399Updated this week
- A library for time series analysis on Apache Spark☆1,191Updated 3 years ago
- Avro Data Source for Apache Spark☆539Updated 5 years ago
- [DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark☆749Updated last month
- Performance tests for Apache Spark☆379Updated 6 years ago
- Scripts used to setup a Spark cluster on EC2☆392Updated 6 years ago
- Stanford CoreNLP wrapper for Apache Spark☆422Updated 5 years ago
- Distributed Neural Networks for Spark☆603Updated 4 years ago
- Interactive Scala REPL in a browser☆742Updated 2 years ago
- A Scala kernel for Jupyter☆1,591Updated last month
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- The missing MatPlotLib for Scala + Spark☆730Updated 2 years ago
- Interactive and Reactive Data Science using Scala and Spark.☆3,152Updated last year
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated last year
- Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code☆290Updated last month
- Spark Knowledge Base☆334Updated 3 years ago
- Streaming MapReduce with Scalding and Storm☆2,139Updated 2 years ago
- Spark reference applications☆656Updated 7 months ago
- A Scala API for Cascading☆3,495Updated last year
- ☆334Updated this week
- Distributed decision tree ensemble learning in Scala☆391Updated 5 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,550Updated this week
- ☆400Updated this week
- Redshift data source for Apache Spark☆605Updated last year
- A free tutorial for Apache Spark.☆979Updated 3 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year
- A Scala feature transformation library for data science and machine learning☆466Updated 2 weeks ago