比赛常用的特征工程、类别不平衡处理方法
☆17Aug 16, 2018Updated 7 years ago
Alternatives and similar repositories for data_process
Users that are interested in data_process are comparing it to the libraries listed below
Sorting:
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆43Jan 14, 2020Updated 6 years ago
- 分类类别不平衡,解决办法:采样(SMOTE和算法集成技术等)、阈值移动、调整代价或权重,附带信用卡诈骗案例☆21Oct 8, 2019Updated 6 years ago
- 马上AI全球挑战赛-违约用户风险预测 top2-solution☆18Jun 21, 2018Updated 7 years ago
- 使用sklearn做特征工程☆178Jul 19, 2018Updated 7 years ago
- Data from Operational Research Seminar☆13Nov 18, 2018Updated 7 years ago
- 招商银行2021年Fintech数据赛道思路及代码☆13May 16, 2021Updated 4 years ago
- Welcome to the SOLETE platform. These scripts are meant to help you using the homonymous dataset [1] and to replicate the results from th…☆11Sep 14, 2023Updated 2 years ago
- 对截止至2017年7月17日的债券违约事件进行梳理归因,并寻找宏观流动性影响因素,组成数据集。运用Lasso回归进行特征提取后,输入带L2惩罚项LR、SVM、NN、GBDT、RF等机器学习模型进 行违约预测,得出GBDT预测效果最好以及特征工程对线性模型预测效果具有重要性的结…☆58Mar 7, 2019Updated 7 years ago
- dbn with smote 增强☆11Apr 12, 2019Updated 6 years ago
- China University Computer Contest-Big Data Challenge (A list: rank1, B list: rank2)☆40Feb 12, 2019Updated 7 years ago
- 讯飞移动广告反欺诈算法竞赛☆33Nov 1, 2019Updated 6 years ago
- 唯品会用户购买行为预测☆32Mar 16, 2018Updated 8 years ago
- 员工离职预测训练赛☆10Aug 25, 2017Updated 8 years ago
- ALI-IJCAI-AD☆18May 15, 2018Updated 7 years ago
- Mars-risk 高性能信贷风控工具箱☆41Mar 15, 2026Updated last week
- ☆10Feb 25, 2023Updated 3 years ago
- 风控算法,特征工程,模型工程,分布式,树模型☆17Oct 31, 2023Updated 2 years ago
- This repo is for the Linkedin Learning course: Predictive Customer Analytics☆11May 5, 2025Updated 10 months ago
- 14天完成数据分析实战项目☆10Sep 7, 2022Updated 3 years ago
- This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…☆17Feb 2, 2024Updated 2 years ago
- 鲁伟《机器学习公式推导与代码实现》。整体对算法的分类是亮点。算法原理和代码实现也相对简单,可以和《机器学习实战》对比起来看。☆11Oct 19, 2022Updated 3 years ago
- Skip-Thought Vectors implement by tensorflow☆11Mar 6, 2018Updated 8 years ago
- Feature selector is a tool for dimensionality reduction of machine learning datasets.☆19Jun 17, 2024Updated last year
- Adaptive Synthetic Sampling Approach for Imbalanced Learning☆13Jun 16, 2013Updated 12 years ago
- CHIP2018评测任务2,平安医疗科技智能患者健康咨询问句匹配大赛baseline,BiLSTM+特征工程计算相似性,10折交叉验证平均投票做bagging,F1值0.83左右,rank16。☆19Dec 4, 2018Updated 7 years ago
- 分别基于statsmodels 和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆21May 21, 2023Updated 2 years ago
- 新版代码生成器☆10Apr 19, 2018Updated 7 years ago
- 《Python深度学习(第2版)》代码及笔记☆22Nov 24, 2022Updated 3 years ago
- Get 50 news from test data,find each of them the most similar 50 news in train data.☆17May 10, 2018Updated 7 years ago
- Data Analysis with Python - Customer Segmentation ( RFM Analysis) - Power BI Dashboard - Tableau Dashboard☆12Feb 16, 2021Updated 5 years ago
- 记录☆19Nov 29, 2025Updated 3 months ago
- 机器学习的特征工程,包括特征抽取、特征预处理、特征选择、特征降维。☆25Feb 25, 2019Updated 7 years ago
- Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk☆13Feb 8, 2018Updated 8 years ago
- Plugin for IDA Pro to convert assembler to LLVM IR☆20Nov 15, 2016Updated 9 years ago
- Reinforcement Learning for Supply Chain Optimization☆14Feb 3, 2020Updated 6 years ago
- Moebius is a cross-chain oracle that bridges on-chain Ethereum data to Solana.☆10Nov 12, 2020Updated 5 years ago
- 同步Hive数据仓库数据到Elasticsearch的小工具☆21Feb 3, 2018Updated 8 years ago
- 该工程是利用python3.6进行“特征提取+分类器”来实现美团评论的文本二分类问题。在特征提取部分提取了6种特征,分类器选择了python里面的包xgboost和lightGBM分别实现提升树和GBDT(梯度提升决策树)。☆96Apr 8, 2019Updated 6 years ago