LinkedInAttic / datacl
A collection of efficient utilities for a data scientist.
☆41Updated 9 years ago
Alternatives and similar repositories for datacl:
Users that are interested in datacl are comparing it to the libraries listed below
- Scalable Machine Learning in Scalding☆360Updated 7 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆51Updated 7 years ago
- Raw Benchmark Data for Popular Machine Learning Frameworks☆56Updated 7 years ago
- demo clients☆20Updated 7 years ago
- Graph Analytics Engine☆260Updated 10 years ago
- Easy distributed TensorFlow on Hadoop (moved to: hops-tensorflow)☆9Updated 7 years ago
- Muppet☆126Updated 3 years ago
- ☆20Updated 8 years ago
- A fault tolerant, protocol-agnostic RPC system☆12Updated 7 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Templates for projects based on top of H2O.☆38Updated 3 weeks ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 7 years ago
- Antelope Realtime Events framework for feature engineering in agile machine learning environments.☆26Updated 9 years ago
- Neural Network engine for Veles distributed machine learning platform☆26Updated 8 years ago
- Repository for SF QConf 2015 Workshop☆16Updated 5 months ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- r³ is a map-reduce engine written in python using redis as a backend☆343Updated 12 years ago
- faceted search engine☆41Updated 10 years ago
- training material☆47Updated 5 months ago
- scalding powered machine learning☆109Updated 10 years ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆21Updated last year
- Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark☆14Updated 8 years ago
- Deprecated. Formerly: scripts to make it easier to set up and manipulate clusters at Amazon EC2☆110Updated 12 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Updated 9 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- A system and a Java API for large-scale graph processing based on Google's Pregel☆64Updated 12 years ago