ankurdave/kmeans-spark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ankurdave/kmeans-spark)

ankurdave / kmeans-spark

A simple implementation of k-means clustering on the Spark cluster computing framework. See http://cs.berkeley.edu/~matei/spark.

☆26

Alternatives and similar repositories for kmeans-spark

Users that are interested in kmeans-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MLWhiz / Spark_Projects
View on GitHub
Spark Projects for the Berkeley Data Science Course
☆13Aug 12, 2015Updated 10 years ago
muricoca / recommendation-lectures
View on GitHub
Guide to Recommender Systems
☆14Feb 24, 2012Updated 14 years ago
kijiproject / kiji-express
View on GitHub
☆16Sep 26, 2014Updated 11 years ago
adamliesko / bigdata-spark
View on GitHub
BerkeleyX: CS100.1x, Introduction to Big Data with Apache Spark
☆10Jul 27, 2015Updated 11 years ago
abhishek-ch / Awesome_Algorithm
View on GitHub
Collection of Interesting Algorithms
☆16Oct 13, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pwendell / spark-twitter-collection
View on GitHub
Spark example of collecting tweets and loading into HDFS/S3
☆42Oct 2, 2013Updated 12 years ago
mirandaio / mit-6.00.1x
View on GitHub
Introduction to Computer Science and Programming Using Python
☆19Nov 5, 2015Updated 10 years ago
memsql / streamliner-starter
View on GitHub
Starter project for building MemSQL Streamliner Pipelines
☆32Apr 18, 2017Updated 9 years ago
yenlung / Introduction-to-Tropical-Geometry-in-IPython
View on GitHub
Just a demo of using IPython to learn a subject, test some ideas, and make notes. The codes here are very very ugly and with no decent al…
☆11Mar 4, 2015Updated 11 years ago
TaiwanSparkUserGroup / docker-spark-hive-ipython
View on GitHub
Spark + Jupyer + Hive
☆12Sep 24, 2015Updated 10 years ago
maxdemarzi / neo_visual_search
View on GitHub
Neo4j POC to Integrate VisualSearch.js and Cypher
☆18May 31, 2016Updated 10 years ago
toddstavish / Cassandra-Graph-Extract
View on GitHub
Extracts A Social Network From Cassandra NoSQL Data-store To The InfiniteGraph Graph Database For Analysis
☆16Aug 26, 2010Updated 15 years ago
avulanov / ann-benchmark
View on GitHub
Benchmarks of artificial neural network library for Spark MLlib
☆11Dec 3, 2015Updated 10 years ago
hewigovens / weibo2citespace
View on GitHub
convert weibo(sina/tencent/netease) data source into an intermediate format supported by citespace
☆10Sep 27, 2011Updated 14 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
greenmoon55 / textclustering
View on GitHub
毕业设计。Keywords: 层次聚类、谱聚类、WordNet
☆10Jun 29, 2014Updated 12 years ago
ThinkBigAnalytics / scalding-workshop
View on GitHub
A half-day workshop on Scalding, the Scala API for Cascading
☆48Mar 21, 2016Updated 10 years ago
jhofman / icwsm2010_tutorial
View on GitHub
example code for "Large-scale social media analysis with Hadoop" tutorial presented at ICWSM 2010
☆42Jul 16, 2010Updated 16 years ago
dvryaboy / pig
View on GitHub
Mirror of Apache Pig
☆18Jul 9, 2013Updated 13 years ago
mkubala / typesafe-config-examples
View on GitHub
A few, straightforward examples which shows how to use Typesafe's Config library and HOCON.
☆10Oct 9, 2013Updated 12 years ago
BestActionNow / Slate_Aware_Ranking
View on GitHub
The implementation for our paper "Slate-Aware Ranking for Recommendation" accepted by WSDM.23
☆16Dec 13, 2022Updated 3 years ago
porcobosso / bert_java_serv
View on GitHub
a demo for how to execute bert_base_chinese based model in java
☆10Mar 8, 2019Updated 7 years ago
ccwang002 / 2015Talk-DeepLearn-CNN
View on GitHub
A short hands-on of CNN using Stanford CS231n online material
☆17Oct 23, 2017Updated 8 years ago
alphaonex86 / debug-devel
View on GitHub
Testing tools (binary/text) for RS232, QTcpSocket, QLocalSocket
☆13Dec 22, 2015Updated 10 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
krareT / ssdb
View on GitHub
SSDB - A fast NoSQL database, an alternative to Redis
☆12Feb 8, 2017Updated 9 years ago
saberma / gitday
View on GitHub
organize your GITHUB NEWS FEED every day
☆15Jul 22, 2012Updated 14 years ago
Lapis-Hong / FM
View on GitHub
using FM latent vectors as embedding features
☆14Sep 7, 2017Updated 8 years ago
MeninaChimp / Kmeans
View on GitHub
一个数据挖掘里的简单聚类算法，使用了JFreeChart用于对分类结果的展示。
☆11Feb 12, 2016Updated 10 years ago
tribbloid / ISpark
View on GitHub
An Apache Spark-shell backend for IPython
☆105Jul 2, 2021Updated 5 years ago
koooee / BigDataR_Examples
View on GitHub
Data Science and Machine Learning Examples for Data Science Linux
☆31Aug 29, 2012Updated 13 years ago
tdunning / pig-vector
View on GitHub
Mahout vector encoding for pig
☆53Nov 20, 2022Updated 3 years ago
elishowk / cablegate_semnet
View on GitHub
Mapping Wikileaks' Cablegate thematics using Python, MongoDB and Gephi
☆17Nov 9, 2018Updated 7 years ago
x-shadow-x / TextCluster
View on GitHub
常用文本聚类算法java实现
☆15Feb 3, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
moses-smt / nplm
View on GitHub
Fork of http://nlg.isi.edu/software/nplm/ with some efficiency tweaks and adaptation for use in mosesdecoder.
☆13Sep 3, 2015Updated 10 years ago
anuragphadke / Flume-Hive
View on GitHub
☆15Dec 14, 2010Updated 15 years ago
emjotde / symgiza-pp
View on GitHub
Symmetrized word alignment models, based on mgizapp and GIZA++
☆14Jun 23, 2014Updated 12 years ago
aws-samples / dynamodb-elasticache-geospatial-workshop
View on GitHub
☆11Mar 28, 2022Updated 4 years ago
JRC1995 / Bi-GRU-CRF-NER
View on GitHub
Attempted implementation of a Bi-directional GRU followed by a linear-chain-CRF (from scratch) for Named Entity Recognition.
☆15Dec 5, 2017Updated 8 years ago
miracle-the-V / bigdecimal-math
View on GitHub
☆10Feb 12, 2015Updated 11 years ago
shashankg7 / Matrix-Factorization-GPU
View on GitHub
Large scale matrix factorization on GPU
☆19Jun 4, 2016Updated 10 years ago