☆92Nov 15, 2015Updated 10 years ago
Alternatives and similar repositories for sampleclean-async
Users that are interested in sampleclean-async are comparing it to the libraries listed below
Sorting:
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Edit-distance-based similar string joiner and clusterer☆18Jul 2, 2015Updated 10 years ago
- Docker container capable of running an iPython notebook server, for "Just Enough Math"☆15Mar 31, 2023Updated 2 years ago
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆660Feb 6, 2014Updated 12 years ago
- Distributed lbfgs on Apache Spark☆11Sep 25, 2020Updated 5 years ago
- CS294 RISE Course Material☆32Jan 23, 2019Updated 7 years ago
- An experimental distributed execution engine☆23Jul 23, 2020Updated 5 years ago
- A Tree Search Library for Data Cleaning☆22Feb 15, 2022Updated 4 years ago
- ☆11Jun 15, 2015Updated 10 years ago
- Add-on gem for creating graphs from AASM state machine definitions☆10Sep 29, 2021Updated 4 years ago
- An efficient updatable key-value store for Apache Spark☆254Mar 11, 2017Updated 9 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Nov 25, 2015Updated 10 years ago
- Fine-Grained Distributed Computing☆11Feb 15, 2016Updated 10 years ago
- Project overview and links to various resources☆21Nov 6, 2021Updated 4 years ago
- An approXimate DB that supports online aggregation queries☆61Apr 16, 2024Updated last year
- Zipkin Mesos Framework☆31Feb 24, 2016Updated 10 years ago
- ☆25Jul 12, 2017Updated 8 years ago
- Smart Meter Data Analysis System☆11Jun 3, 2019Updated 6 years ago
- repository for R library "sbrlmod"☆26May 5, 2024Updated last year
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- 基于hanlp工具包的es分词插件☆10Mar 20, 2018Updated 8 years ago
- Scala library for converting Spark rows to case classes☆11Mar 14, 2017Updated 9 years ago
- ☆14Apr 8, 2017Updated 8 years ago
- A framework for systematically quality controlling big data.☆40Mar 13, 2023Updated 3 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- An open-source, vendor-neutral data context service.☆160Mar 6, 2018Updated 8 years ago
- Activator Template for BigPipe with Play, RxJava, and Hystrix☆20May 20, 2015Updated 10 years ago
- A we analytics and event tracking sleuth JavaScript library☆39Mar 29, 2017Updated 8 years ago
- ☆11Jul 12, 2021Updated 4 years ago
- A challenge to investigate the security of the InstaHide protocol.☆12Dec 7, 2020Updated 5 years ago
- A Trie data structure that allows for fuzzy string matching☆11May 24, 2015Updated 10 years ago
- ☆14Aug 23, 2015Updated 10 years ago
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- Sample custom Nifi processor to process tcpdump☆18Nov 19, 2015Updated 10 years ago
- ☆30Aug 13, 2013Updated 12 years ago
- Benchmarking for factorized processing☆11Jul 5, 2021Updated 4 years ago