☆92Nov 15, 2015Updated 10 years ago
Alternatives and similar repositories for sampleclean-async
Users that are interested in sampleclean-async are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Jul 2, 2017Updated 8 years ago
- ☆110Apr 17, 2017Updated 9 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Apr 18, 2017Updated 9 years ago
- ☆29Dec 3, 2016Updated 9 years ago
- Docker container capable of running an iPython notebook server, for "Just Enough Math"☆16Mar 31, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- BlinkDB: Sub-Second Approximate Queries on Very Large Data.☆661Feb 6, 2014Updated 12 years ago
- Distributed lbfgs on Apache Spark☆10Sep 25, 2020Updated 5 years ago
- CS294 RISE Course Material☆32Jan 23, 2019Updated 7 years ago
- An experimental distributed execution engine☆23Jul 23, 2020Updated 5 years ago
- machine learning library & code generator☆24Dec 17, 2014Updated 11 years ago
- A Tree Search Library for Data Cleaning☆22Feb 15, 2022Updated 4 years ago
- A Generalized Data Cleaning System☆51Apr 28, 2016Updated 10 years ago
- Additional useful algorithms that can be used with spark.☆24Dec 24, 2014Updated 11 years ago
- ☆11Jun 15, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Add-on gem for creating graphs from AASM state machine definitions☆10Sep 29, 2021Updated 4 years ago
- Data-ish exploration through SQL+Uncertainty☆28Oct 31, 2022Updated 3 years ago
- An efficient updatable key-value store for Apache Spark☆255Mar 11, 2017Updated 9 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Nov 25, 2015Updated 10 years ago
- An approXimate DB that supports online aggregation queries☆61Apr 16, 2024Updated 2 years ago
- Zipkin Mesos Framework☆31Feb 24, 2016Updated 10 years ago
- ☆25Jul 12, 2017Updated 8 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆146Jan 26, 2016Updated 10 years ago
- Regularized latent variable mixed membership modeling☆13Aug 12, 2013Updated 12 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 基于hanlp工具包的es分词插件☆10Mar 20, 2018Updated 8 years ago
- ☆14Apr 8, 2017Updated 9 years ago
- A framework for systematically quality controlling big data.☆41Mar 13, 2023Updated 3 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- An open-source, vendor-neutral data context service.☆161Mar 6, 2018Updated 8 years ago
- Activator Template for BigPipe with Play, RxJava, and Hystrix☆20May 20, 2015Updated 11 years ago
- A we analytics and event tracking sleuth JavaScript library☆39Mar 29, 2017Updated 9 years ago
- A comprehensive benchmark for data cleaning methods and their impact of ML models☆16Jul 24, 2024Updated last year
- A Trie data structure that allows for fuzzy string matching☆11May 24, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reranking for Multi-objective Optimized Recommender Systems☆11Aug 3, 2023Updated 2 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆30Oct 13, 2020Updated 5 years ago
- python client library☆10Feb 15, 2017Updated 9 years ago
- Cookiecutter for community-maintained Jupyter Docker images☆17May 4, 2026Updated 2 weeks ago
- Better code block highlighting with Prism☆12Apr 2, 2026Updated last month
- Provides interfaces, functions and codecs that can be used to encode/decode data to/from various formats.☆32Apr 3, 2017Updated 9 years ago
- A Datalog API for Spark☆25Sep 7, 2016Updated 9 years ago