☆92Nov 15, 2015Updated 10 years ago
Alternatives and similar repositories for sampleclean-async
Users that are interested in sampleclean-async are comparing it to the libraries listed below
Sorting:
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- ☆28Dec 3, 2016Updated 9 years ago
- Add-on gem for creating graphs from AASM state machine definitions☆10Sep 29, 2021Updated 4 years ago
- Scala non-blocking Aerospike client (archived as unmaintained)☆20Jan 25, 2019Updated 7 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- An efficient updatable key-value store for Apache Spark☆254Mar 11, 2017Updated 8 years ago
- Distributed lbfgs on Apache Spark☆11Sep 25, 2020Updated 5 years ago
- ☆11Jun 15, 2015Updated 10 years ago
- Data-ish exploration through SQL+Uncertainty☆27Oct 31, 2022Updated 3 years ago
- machine learning library & code generator☆24Dec 17, 2014Updated 11 years ago
- ☆11Nov 20, 2020Updated 5 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- Smart Meter Data Analysis System☆11Jun 3, 2019Updated 6 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- A Trie data structure that allows for fuzzy string matching☆11May 24, 2015Updated 10 years ago
- Sample Play 2.1/2.2 application to demonstrate web sockets usage☆40Dec 29, 2014Updated 11 years ago
- Zipkin Mesos Framework☆31Feb 24, 2016Updated 10 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆659Feb 6, 2014Updated 12 years ago
- ☆61Mar 8, 2012Updated 13 years ago
- Docker container capable of running an iPython notebook server, for "Just Enough Math"☆15Mar 31, 2023Updated 2 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Apr 8, 2017Updated 8 years ago
- Open Source Log, Exception, Metrics management.☆15Aug 19, 2017Updated 8 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Oct 13, 2020Updated 5 years ago
- ☆16Feb 8, 2020Updated 6 years ago
- Provides interfaces, functions and codecs that can be used to encode/decode data to/from various formats.☆32Apr 3, 2017Updated 8 years ago
- A scala based DSL and framework for writing and executing bioinformatics pipelines as Directed Acyclic GRaphs☆69May 27, 2022Updated 3 years ago
- A we analytics and event tracking sleuth JavaScript library☆39Mar 29, 2017Updated 8 years ago
- Activator Template for BigPipe with Play, RxJava, and Hystrix☆20May 20, 2015Updated 10 years ago
- Instance agent for collecting metrics and logs☆16Aug 15, 2017Updated 8 years ago
- ☆40Aug 31, 2016Updated 9 years ago
- Scalable Java Disque client☆35Jul 2, 2016Updated 9 years ago
- A Tree Search Library for Data Cleaning☆22Feb 15, 2022Updated 4 years ago
- Project overview and links to various resources☆21Nov 6, 2021Updated 4 years ago
- X-Trace is a tool that provides fine-grained visibility into large, complex distributed systems. It can be used by application developers…☆28Jun 9, 2014Updated 11 years ago
- Pushmanager is a web application to manage source code deployments.☆38Mar 25, 2015Updated 10 years ago
- C++ APIs for Alluxio (formerly Tachyon)☆18Nov 29, 2016Updated 9 years ago