mitdbg / bigdataLinks
MIT Big Data Challenge
☆14Updated 11 years ago
Alternatives and similar repositories for bigdata
Users that are interested in bigdata are comparing it to the libraries listed below
Sorting:
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- Exploration Library in Java☆12Updated last year
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Cascading and Scalding wrapper for HBase with advanced read features☆54Updated 5 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Repository for SF QConf 2015 Workshop☆16Updated 7 months ago
- scalding powered machine learning☆109Updated 10 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- A collection of useful utility classes and functions.☆9Updated 4 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 9 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 8 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Akka Cluster for Value-at-Risk calculation☆14Updated 11 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- ☆20Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Updated 9 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 9 years ago
- ☆23Updated 7 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago