sjyk/sampleclean-async

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sjyk/sampleclean-async)

sjyk / sampleclean-async

☆92

Alternatives and similar repositories for sampleclean-async

Users that are interested in sampleclean-async are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amplab / ampcrowd
View on GitHub
A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.
☆52Jul 2, 2017Updated 9 years ago
sjyk / sampleclean
View on GitHub
SampleClean+BlinkDB
☆18May 21, 2014Updated 12 years ago
amplab / velox-modelserver
View on GitHub
☆110Apr 17, 2017Updated 9 years ago
amplab / keystone
View on GitHub
Simplifying robust end-to-end machine learning on Apache Spark.
☆473Apr 18, 2017Updated 9 years ago
h2oai / h2o-sparkling
View on GitHub
DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.
☆44Nov 25, 2014Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sameeragarwal / blinkdb
View on GitHub
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
☆660Feb 6, 2014Updated 12 years ago
LHAC / dac
View on GitHub
Distributed lbfgs on Apache Spark
☆10Sep 25, 2020Updated 5 years ago
AtlasPilotPuppy / SparkAlgorithms
View on GitHub
Additional useful algorithms that can be used with spark.
☆24Dec 24, 2014Updated 11 years ago
amplab / spark-indexedrdd
View on GitHub
An efficient updatable key-value store for Apache Spark
☆255Mar 11, 2017Updated 9 years ago
rbalasub / jigsaw
View on GitHub
Regularized latent variable mixed membership modeling
☆13Aug 12, 2013Updated 12 years ago
amplab / orchestra
View on GitHub
Fine-Grained Distributed Computing
☆11Feb 15, 2016Updated 10 years ago
aasm / aasm_graph
View on GitHub
Add-on gem for creating graphs from AASM state machine definitions
☆10Sep 29, 2021Updated 4 years ago
sparksummit / 2015
View on GitHub
☆11Jun 15, 2015Updated 11 years ago
Yeye-He / Self-Service-Data-Preparation
View on GitHub
Project overview and links to various resources
☆21Nov 6, 2021Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
DeepSE / DeepCodingBaselines
View on GitHub
☆24Jul 12, 2017Updated 9 years ago
elodina / zipkin-mesos-framework
View on GitHub
Zipkin Mesos Framework
☆31Feb 24, 2016Updated 10 years ago
mediative / sparrow
View on GitHub
Scala library for converting Spark rows to case classes
☆11Mar 14, 2017Updated 9 years ago
gitchennan / elasticsearch-analysis-lc
View on GitHub
基于hanlp工具包的es分词插件
☆10Mar 20, 2018Updated 8 years ago
morlay / gitbook-plugin-mermaid-2
View on GitHub
☆14Apr 8, 2017Updated 9 years ago
ground-context / ground
View on GitHub
An open-source, vendor-neutral data context service.
☆163Mar 6, 2018Updated 8 years ago
qcri / NADEEF
View on GitHub
A Generalized Data Cleaning System
☆52Apr 28, 2016Updated 10 years ago
dvasilen / Hive-XML-SerDe
View on GitHub
XML Serializer/Deserializer for Apache Hive
☆41Sep 25, 2019Updated 6 years ago
jamra / LevenshteinTrie
View on GitHub
A Trie data structure that allows for fuzzy string matching
☆11May 24, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
collectivemedia / spark-ext
View on GitHub
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
☆145Jan 26, 2016Updated 10 years ago
GeoscienceAustralia / sira
View on GitHub
Systemic Infrastructure Resilience Analysis
☆13Jun 24, 2026Updated last month
findopendata / findopendata
View on GitHub
A search engine for Open Data
☆60Mar 15, 2023Updated 3 years ago
mighdoll / sparkle
View on GitHub
visualization server
☆138Oct 13, 2015Updated 10 years ago
Tapad / scaerospike
View on GitHub
Scala non-blocking Aerospike client (archived as unmaintained)
☆20Jan 25, 2019Updated 7 years ago
ykifle / audiograph
View on GitHub
A GUI for messing with the Web Audio API
☆21Oct 17, 2012Updated 13 years ago
stratosphere / stratosphere
View on GitHub
Stratosphere is now Apache Flink.
☆201Dec 16, 2023Updated 2 years ago
jupyter / cookiecutter-docker-stacks
View on GitHub
Cookiecutter for community-maintained Jupyter Docker images
☆18Jul 7, 2026Updated 2 weeks ago
yahoo / storm-yarn
View on GitHub
Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.
☆418Jul 21, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mandubian / zpark-ztream
View on GitHub
Driving Spark stream with Scalaz-Stream
☆26Mar 18, 2014Updated 12 years ago
blackboxnlp / 2020
View on GitHub
☆11Nov 20, 2020Updated 5 years ago
concord / concord-py
View on GitHub
python client library
☆10Feb 15, 2017Updated 9 years ago
Tapad / gulp-angular-builder
View on GitHub
Gulp plugin to filter and include only necessary AngularJS files. (archived as unmaintained)
☆14Dec 22, 2015Updated 10 years ago
ianozsvald / learning_text_transformer_demo
View on GitHub
Demo code for learning_text_transformer
☆25Feb 22, 2015Updated 11 years ago
Aetf / hexo-prism-plus
View on GitHub
Better code block highlighting with Prism
☆12Apr 2, 2026Updated 3 months ago
josephxsxn / moya
View on GitHub
Memcached on YARN
☆19Jun 2, 2014Updated 12 years ago