mitdbg / bigdata
MIT Big Data Challenge
☆14Updated 10 years ago
Related projects: ⓘ
- A library of machine learning algorithms implemented using principles of functional programming.☆22Updated 7 years ago
- ☆27Updated this week
- Sparse feature extraction with Spark☆29Updated 6 years ago
- Real-time query spark and visualise it as graph.☆24Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- ☆16Updated this week
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 6 years ago
- Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure☆45Updated 2 years ago
- VoltDB Click Stream Processing Example.☆16Updated 6 years ago
- Reactive Outlier Detection Engine☆12Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- ☆21Updated this week
- A system and a Java API for large-scale graph processing based on Google's Pregel☆63Updated 11 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 9 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 7 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆14Updated 7 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- approximate streaming quantiles☆31Updated 10 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Updated 9 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 8 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 9 years ago
- A collection of efficient utilities for a data scientist.☆40Updated 9 years ago
- Set of Hadoop, Spark and Storm based tools for web and customer analytic☆34Updated 3 years ago
- ☆31Updated this week
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- ☆40Updated this week