DMDarkness / Data-mining
Some algorithms to form frequent itemsets/association rules from datasets, where many techniques, such as FP-tree, Apriori, PSO, GA, Granular computing, Central limit theorem and so on, are applied
☆62Updated 7 years ago
Alternatives and similar repositories for Data-mining:
Users that are interested in Data-mining are comparing it to the libraries listed below
- Who Stole the Postage? Fraud Detection in Return-Freight Insurance Claims☆20Updated 6 years ago
- Frequent Pattern Mining☆36Updated 6 years ago
- 天池-印象盐城-汽车销量预测大赛☆36Updated 7 years ago
- 现有聚类算法面向高维稀疏数据多未考虑类簇可重叠和离群点的存在,导致聚类效果不理想。针对此,提出一种可重叠子空间K-Means聚类算法(An Overlapping Subspace K-Means Clustering Algorithm, OS-K-Means)。给出类簇…☆30Updated 5 years ago
- 通过科研人员论文项目等数据,训练识别导师/学生的分类器。代码包括特征选择基础、网格搜索确定特征选择方法参数、不平衡数据的处理(oversampling、undersampling)和pu-learning方法在此问题上的应用☆30Updated 5 years ago
- 常用的特征选择方法☆68Updated 2 years ago
- Implement a community detection algorithm using a divisive hierarchical clustering (Girvan-Newman algorithm)☆16Updated 9 years ago
- 基于kaggle上Titanic数据集实现的ID3、C4.5、CART和CART剪枝算法☆40Updated 6 years ago
- FP-growth codes in "Machine Learning in Action"☆53Updated 6 years ago
- ☆11Updated 7 years ago
- 🍊 Frequent itemsets and association rules mining for Orange 3.☆65Updated 7 months ago
- 机器学习的特征工程,包括特征抽取、特征预处理、特征选择、特征降维。☆25Updated 6 years ago
- 使用sklearn做特征工程☆172Updated 6 years ago
- 本目录主要包含知识图谱小组每周学习例会的资源,包含paper,slides,report等。我们可以通过查看文件更新来提前做好充分准备并了解最新例会学习内容。☆21Updated 8 years ago
- 基于gensim对BOW,TFIDF,LDA,LSI,W2V等传统的文本表示模型进行简单的封装,并添加了chi2,互信息等特征选择方法☆12Updated 5 years ago
- 一种时间序列突变检验算法☆17Updated 8 years ago
- Examples of how to do feature engineering and Xgboost parameter tuning☆46Updated 8 years ago
- 《机器学习之类别不平衡问题》文章代码☆48Updated 6 years ago
- JDD-2017 京东金融大数据竞赛-销量预测-15th/889队☆37Updated 7 years ago
- 在大量标注情感的英文语料与大量未标注情感的中文语料,对给定中文语料进行情感分析。☆30Updated 7 years ago
- 标签传播算法☆11Updated 6 years ago
- ☆29Updated 5 years ago
- 国内首个迁移学习赛题 中国平安前海征信“好信杯”迁移学习大数据算法大赛 FInSight团队作品(算法方案排名第三)☆87Updated 6 years ago
- Simple sklearn based python implementation of Positive-Unlabeled (PU) classification using bagging based ensembles☆91Updated 8 years ago
- A fast algorithm to find communities in large network☆27Updated 7 years ago
- [UNMAINTAINED] 基于PySpark与MySQL的复杂网络链路预测。☆22Updated 7 years ago
- 滴滴出行供需预测大赛--十强☆44Updated 8 years ago
- Time series prediction and text analysis using Keras LSTM, plus clustering, association rules mining☆33Updated 7 years ago
- ☆130Updated 7 years ago
- 数据预处理之缺失值处理,特征选择☆21Updated 6 years ago