mitdbg / bigdata
MIT Big Data Challenge
☆14Updated 10 years ago
Alternatives and similar repositories for bigdata:
Users that are interested in bigdata are comparing it to the libraries listed below
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Reactive Outlier Detection Engine☆11Updated 9 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- A system and a Java API for large-scale graph processing based on Google's Pregel☆64Updated 12 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- Sparse feature extraction with Spark☆29Updated 6 years ago
- Trivial Spark app that counts Titan vertices☆10Updated 9 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Sparking Using Java8☆17Updated 9 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Stand-alone ANSI SQL for Cascading on Apache Hadoop☆48Updated 6 years ago
- PredictionIO word2vec engine template (Scala-based parallelized engine)☆12Updated 9 years ago
- A collection of Scala graph libraries and adapters for graph databases.☆14Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Thin reactive framework to provide and consume REST services☆48Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Machine Learning Open Source Software☆23Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Code base for DSLs In Action (http://www.manning.com/ghosh)☆43Updated 14 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Updated 4 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆19Updated 10 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 8 years ago
- chef cookbook to install Apache Spark☆10Updated 9 years ago