Some Spark implementations of clustering algorithms.
☆19Nov 13, 2018Updated 7 years ago
Alternatives and similar repositories for spark-clustering
Users that are interested in spark-clustering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark ML implementation of SOM algorithm (Kohonen self-organizing map)☆20Feb 4, 2022Updated 4 years ago
- C4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.☆132Jan 26, 2021Updated 5 years ago
- Project defining the docker image that will support examples of algorithms created in this organization☆13Oct 22, 2017Updated 8 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- Sadnbox of Spark-notebook☆10Mar 19, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- https://code.google.com/p/graph-theory-algorithms-book/☆21Oct 29, 2015Updated 10 years ago
- This package contains the code for executing clustering validity indices in Spark. The package includes BD-Silhouette, BD-Dunn, Davies-Bo…☆10Oct 29, 2018Updated 7 years ago
- An application to monitor and drive the Spark JobServer☆12Dec 12, 2014Updated 11 years ago
- Spark Time Series Set data analysis☆12Dec 14, 2020Updated 5 years ago
- Data Science with Apache Spark and Spark Notebook☆30Jul 24, 2017Updated 8 years ago
- Clustering stability analysis in Python with a scikit-learn compatible API.☆23May 17, 2023Updated 2 years ago
- Approximate cardinality estimation with HyperLogLog, as a Hive function☆42Dec 17, 2012Updated 13 years ago
- Scala/Spark implementation of Distributed Nearest Neighbours Mean Shift using LSH☆30May 2, 2019Updated 7 years ago
- 优化flink的多流操作(例如join),优化点不限于数据丢失问题,以及性能问题☆11Apr 8, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- My dotfiles.☆12Oct 10, 2025Updated 6 months ago
- Searching for an honest classifier☆17Jan 14, 2016Updated 10 years ago
- This project collects the map assets (Shapefiles and GeoJSON) that were used for the "Manifest Destiny" Visualization (http://michaelpora…☆24Oct 25, 2012Updated 13 years ago
- Docker containers with Apache Accumulo and Apache Spark environment.☆12Jan 22, 2016Updated 10 years ago
- SOMperf: Self-organizing maps performance metrics and quality indices☆39Jul 11, 2024Updated last year
- SSM框架构建商城+论坛☆15Jun 30, 2018Updated 7 years ago
- Keyword extraction package for Spark.☆12Jan 15, 2017Updated 9 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 9 years ago
- An R package providing datasets useful for testing clustering algorithms☆17Apr 11, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A web service for discovery of destinations matching your expected weather conditions (and hints on how to get there).☆32Apr 23, 2016Updated 10 years ago
- A primal-dual framework for distributed L1-regularized optimization☆37Apr 18, 2016Updated 10 years ago
- Simple and secure artifact signing for sbt.☆49Sep 3, 2024Updated last year
- ☆10Apr 27, 2019Updated 7 years ago
- Formulaire en ligne qui génère une attestation de déplacement dérogatoire☆10Mar 18, 2020Updated 6 years ago
- It consists of all code examples discussed as part of architectural patterns course taken at algorithmica☆12Nov 10, 2019Updated 6 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Jan 6, 2017Updated 9 years ago
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- Quick and simple data visualization tool.☆11Aug 10, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ODPi specifications, developed by ODPi Runtime and ODPi Operations projects. Currently in Emeritus status☆35Feb 12, 2019Updated 7 years ago
- Time series foreasting using Facebook's Prophet and Apache Spark☆14Dec 9, 2019Updated 6 years ago
- Backend for Location Tracker app.☆10May 7, 2018Updated 8 years ago
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- To provide a list all regions of China, and includes province, city, district and parents relationship☆30Sep 10, 2009Updated 16 years ago
- From this paper: Density-based clustering for real-time stream data☆10Jan 7, 2017Updated 9 years ago
- Scala Driver for ArangoDB☆19Apr 27, 2024Updated 2 years ago