YiFraternity / OS-K-Means
现有聚类算法面向高维稀疏数据多未考虑类簇可重叠和离群点的存在,导致聚类效果不理想。针对此,提出一种可重叠子空间K-Means聚类算法(An Overlapping Subspace K-Means Clustering Algorithm, OS-K-Means)。给出类簇子空间计算策略,在聚类过程中动态更新每个类簇的属性子空间,并定义合理的约束函数指导聚类过程,从而实现类簇的可重叠性与寻找离群点的效果。具体地,定义合理的目标函数对传统的K-Means算法进行修正,利用熵权约束分别计算每个类簇中每个维度的权重,使用权重值来标识对不同类簇中维度的相对重要性,并加入对重叠程度和离群值数量控制的参数。
☆30Updated 5 years ago
Alternatives and similar repositories for OS-K-Means:
Users that are interested in OS-K-Means are comparing it to the libraries listed below
- 常用的特征选择方法☆68Updated 2 years ago
- 数据预处理之缺失值处理,特征选择☆21Updated 6 years ago
- 改进的k-prototypes聚类算法☆18Updated 4 years ago
- Optimizing k-means++ initialization using PSO☆17Updated 9 years ago
- 由时间空间成对组成的轨迹序列,通过循环神经网络lstm,自编码器auto-encode,时空密度聚类st-dbscan做异常检测☆71Updated 5 years ago
- AutoEncoder implements by keras. Including AE, DAE, DAE_CNN, VAE, VAE_CNN, CVAE, Sparse AE, Stacked DAE.☆41Updated 4 years ago
- 本项目开发了一个机器学习和深度学习的训练工具。该训练工具基于sklearn和pytorch,不仅支持常规训练、交叉验证训练,还支持贝叶斯搜索参数,并可随时自动保存训练模型和日志。☆12Updated last year
- Source code of our paper: An overlapping community detection algorithm based on density peaks☆13Updated 6 years ago
- 基于Python实现了K-Means、GMM、DBSCAN、AGNES等四种常见的聚类算法☆68Updated 6 years ago
- 基于遗传算法的特征选择☆127Updated 5 years ago
- 使用遗传算法结合决策树做特征选择/Using genetic algorithm for feature selection with decision tree☆24Updated 6 years ago
- 聚类算法k-means的简单实现☆37Updated 6 years ago
- 集成学习Stacking方法详解☆74Updated 5 years ago
- This is an implementation of the paper on "Improved K-means algorithm based on density Canopy".☆30Updated 5 years ago
- 使用 tensorflow2.0 实现图卷积神经网络GCN☆20Updated 4 years ago
- K-Means聚类算法及其改进☆31Updated 6 years ago
- There are some reproduced algorithms for learning from imbalanced data, including over-sampling,under-sampling and boosting☆12Updated last year
- Removal of information from co-association matrix for clustering ensemble☆17Updated 5 years ago
- 机器学习的特征工程,包括特征抽取、特征预处理、特征选择、特征降维。☆25Updated 6 years ago
- This algorithm is based on the paper 'K-Means clustering algorithm Based on Adapative Feature Weighted'☆29Updated 5 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆41Updated 5 years ago
- 2018年研究生数学建模F组题☆14Updated 2 years ago
- Python Code For 'Clustering By Fast Search And Find Of Density Peaks' In Science 2014.(原算法地址:https://github.com/lanbing510/DensityPeakClu…☆15Updated 5 years ago
- Dynamic Graph-Based Label Propagation for Density Peaks Clustering☆42Updated 2 years ago
- Oversampling method based on relative density☆11Updated 4 years ago
- Affinity Propagation Clustering with DTW distance on temporal sequence classification☆19Updated 6 years ago
- Community detection on Hollywood actors using various models: Louvain, Clauset-Newman-Moore, GCN, GraphSage, and GAT.☆10Updated 5 years ago
- 项目基于论文《Fuzzy c-Means Algorithms for Very Large Data》,使用Python语言实现FCM算法及其扩展算法,包括FCM、spFCM、oFCM、kFCM、reskFCM、spkFCM、okFCM。☆63Updated 5 years ago
- 建立SARIMA-LSTM混合模型预测时间序列问题。以PM2.5值为例,使用UCI公开的自2013年1月17日至2015年12月31日五大城市PM2.5小时检测数据,将数据按时间段划分,使用SARIMA过滤其线性趋势,再对过滤后的残差使用LSTM进行预测,最后对预测结果进行…☆75Updated 6 years ago
- 机器学习集成模型之Stacking各类模型及工具源码☆116Updated 4 years ago