mitdbg / bigdataLinks
MIT Big Data Challenge
☆14Updated 11 years ago
Alternatives and similar repositories for bigdata
Users that are interested in bigdata are comparing it to the libraries listed below
Sorting:
- Exploration Library in Java☆12Updated last year
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Code for the course Principles Of Reactive Programming, Spring 2015 session☆23Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- ☆24Updated 10 years ago
- scalding powered machine learning☆109Updated 10 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- ReactiveLDA is a fast, lightweight implementation of the Latent Dirichlet Allocation (LDA) algorithm, using a parallel vanilla Gibbs samp…☆61Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- REST API server with built in auth, interface to ScyllaDB/Cassandra☆24Updated 7 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Updated 5 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- Spark exploration☆19Updated 10 years ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 8 years ago
- VW, Liblinear and StreamSVM compared on webspam☆14Updated 10 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago