mitdbg / bigdata
MIT Big Data Challenge
☆14Updated 10 years ago
Alternatives and similar repositories for bigdata:
Users that are interested in bigdata are comparing it to the libraries listed below
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- A library of machine learning algorithms implemented using principles of functional programming.☆23Updated 8 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Updated 7 years ago
- Repository for SF QConf 2015 Workshop☆16Updated 3 months ago
- Templates for projects based on top of H2O.☆37Updated 3 months ago
- A collection of Scala graph libraries and adapters for graph databases.☆14Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- scalding powered machine learning☆109Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆24Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- Alenka JDBC is a library for accessing and manipulating data with the open-source GPU database Alenka.☆19Updated 10 years ago
- Sparse feature extraction with Spark☆30Updated 6 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- Exploration Library in Java☆12Updated last year
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Quick starts for Teiid WildFly☆25Updated 5 years ago
- Reactive Outlier Detection Engine☆11Updated 9 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Distributed SQL query engine for big data☆10Updated last year
- An example project for doing grid search in MLlib☆13Updated 10 years ago
- Playground for instrumenting `scalac` using AspectJ.☆43Updated 10 years ago
- Data-ish exploration through SQL+Uncertainty☆27Updated 2 years ago