big-data-research / in-memory-data-pipelineView external linksLinks
The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.
☆10Jun 1, 2015Updated 10 years ago
Alternatives and similar repositories for in-memory-data-pipeline
Users that are interested in in-memory-data-pipeline are comparing it to the libraries listed below
Sorting:
- Data science repo to help others☆12Feb 10, 2016Updated 10 years ago
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- ☆15Dec 6, 2016Updated 9 years ago
- Docker files for the example code in Big Data for Chimps☆20May 19, 2015Updated 10 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Jan 30, 2014Updated 12 years ago
- A Neural network implementation with Scala☆20Jul 17, 2016Updated 9 years ago
- install Cloudera's distribution of Hadoop including Cloudera Manager and Cloudera Search (Beta)☆31Aug 16, 2013Updated 12 years ago
- FRED simulator and associated paper☆26Jan 15, 2016Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- ⛅ Run OpenVSCode Server in Google Cloud Shell☆11Dec 22, 2023Updated 2 years ago
- Implementation of an algorithm computing the nearest "N" neighbours to a vector, using a collection of hyperplane hashers.☆30Jul 17, 2015Updated 10 years ago
- A Hivemall wrapper for Spark☆31Apr 21, 2016Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Apr 28, 2017Updated 8 years ago
- ADMM on Apache Spark☆31Jul 21, 2015Updated 10 years ago
- Submission for CS50x online course by Harvard University☆11Aug 7, 2014Updated 11 years ago
- An R package implementing Large-Scale Evidence Generation and Evaluation in a Network of Databases (LEGEND).☆10Dec 18, 2020Updated 5 years ago
- This repository contains my slides and references for a presentation to the UW eScience Institute on using Docker for reproducible resear…☆10Feb 11, 2015Updated 11 years ago
- An HA deployment of Kubernetes, using Kubespray and as few cloud components as possible!☆33May 17, 2018Updated 7 years ago
- ☆33Jan 9, 2016Updated 10 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- ☆13Feb 6, 2026Updated last week
- Examples of Selenium in Python☆11Jun 11, 2018Updated 7 years ago
- code repository for Deep learning for NLP using Python (v), Published by Packt☆11Jan 15, 2021Updated 5 years ago
- An R package for interfacing with a WebAPI instance☆10Sep 5, 2025Updated 5 months ago
- Data Standards Hackathon for NGS based typing.☆14Feb 2, 2026Updated last week
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- Single node Cloudera environment in docker☆10Jan 16, 2016Updated 10 years ago
- A primal-dual framework for distributed L1-regularized optimization☆36Apr 18, 2016Updated 9 years ago
- SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.☆11May 22, 2023Updated 2 years ago
- Factorization Machines for Julia☆11Aug 26, 2016Updated 9 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- A polystore database from researchers of the Intel Science and Technology Center for Big Data☆39Oct 4, 2022Updated 3 years ago
- ☆40Feb 1, 2017Updated 9 years ago
- A connector for SingleStore and Spark☆162Sep 24, 2025Updated 4 months ago
- elPrep reimplementations in C++ and Java, only for benchmark comparisons☆10Feb 13, 2023Updated 3 years ago
- Checks if all your cluster members are receiveing published messages over the bus.☆14Jan 15, 2026Updated 3 weeks ago
- Project Design Review Checklist☆13Sep 22, 2018Updated 7 years ago
- Django site with user registration functionality powered by Userena.☆17Sep 22, 2013Updated 12 years ago
- Code for reproducing key results in the paper "Improving the Neural GPU Architecture for Algorithm Learning" by Karlis Freivalds, Renars …☆13Jul 4, 2018Updated 7 years ago