使用sklearn做特征工程
☆177Jul 19, 2018Updated 7 years ago
Alternatives and similar repositories for sklearn-feature-engineering
Users that are interested in sklearn-feature-engineering are comparing it to the libraries listed below
Sorting:
- 机器学习的特征工程,包括特征抽取、特征预处理、特征选择、特征降维。☆25Feb 25, 2019Updated 7 years ago
- 通过Featuretools自动特征工程预测贷款偿还☆20Jan 10, 2020Updated 6 years ago
- CCF2018 数据挖掘 机器学习 智能匹配 特征工程☆50Sep 27, 2019Updated 6 years ago
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- 66 classic and common interview problems from 《剑指offer》 with multiple-method-CPP solutions, and common data structure summary, etc☆20Mar 10, 2021Updated 4 years ago
- [译] 面向机器学习的特征工程☆2,554Aug 25, 2023Updated 2 years ago
- 通过将对上市公司招股说明书情绪分析的结果与常用财务指标、企业科研指标等结合,综合使用多种分类模型:传统LR、随机森林、XGB、LGB集成学习模型对新上市公司破发情况进行学习和预测,筛选重要特征,并由此来得到一个新股破发分类器。☆14Aug 26, 2023Updated 2 years ago
- jdata2019-用户对品类店铺的购买预测Rank4☆10Jul 22, 2019Updated 6 years ago
- CCF BDCI 2022比赛 返乡发展人群预测赛题 Baseline 数据挖掘(特征工程+集成学习)队伍排名39/2297☆12Mar 15, 2024Updated last year
- 为天池数据竞赛写的自动化特征工程和训练工具,可以通过配置的方式从mysql数据库中生成特征。同时重新封装了数据,特征和模型,使其可以被自动化测试系统识别及调用。待完成的工作:自动化测试系统的调度关键技术。☆12Dec 6, 2015Updated 10 years ago
- Showing the relationship between ImageNet ID and labels and pytorch pre-trained model output ID and labels☆10Oct 11, 2020Updated 5 years ago
- This is a group project for E-commerce repeat buyers purchase prediction using machine learning while accounting for imbalance outcome fo…☆12Dec 29, 2020Updated 5 years ago
- The ts302_team final solution to the KDD CUP 2019 AutoML Track problem.☆15Jul 3, 2020Updated 5 years ago
- 员工离职预测训练赛☆10Aug 25, 2017Updated 8 years ago
- TalkingData AdTracking Fraud Detection Challenge on Kaggle Competition☆13Sep 24, 2018Updated 7 years ago
- 数据预处理过程(属性选择, 异常值处理, 归一化, 标准化等)☆65Jul 25, 2020Updated 5 years ago
- 一些个人学习笔记☆61Apr 6, 2021Updated 4 years ago
- 天池大数据竞赛 千里马大赛 风险识别与预测赛题 Top5☆14May 16, 2019Updated 6 years ago
- 常见的数据预处理,包括数据加载、缺失值&异常值处理、描述性变量转换为数值型、训练测试集划分、数据规范化☆48Sep 19, 2023Updated 2 years ago
- Spatial-Temporal Graph Convolutional Neural Network with LSTM layers☆17Aug 7, 2020Updated 5 years ago
- Bytedance ICME2019☆13Apr 12, 2019Updated 6 years ago
- 常用的特征选择方法☆67Jul 4, 2022Updated 3 years ago
- AutoML☆38Feb 14, 2019Updated 7 years ago
- 信用卡反欺诈消费预测模型☆10Aug 2, 2019Updated 6 years ago
- 在sklearn下,几种常用的特征选择方法☆41Jan 21, 2016Updated 10 years ago
- A Visual-semantic embedding model using word2vec and CNNs☆14Dec 31, 2017Updated 8 years ago
- 天池大数据竞赛平台上IJCAI-17口碑商家客流量预测比赛的解题思路和代码☆19Mar 17, 2017Updated 8 years ago
- GridAttackAnalyzer: Smart Grid Attack Analysis Framework☆21Feb 15, 2022Updated 4 years ago
- 京东JDATA2019-用户对品类下店铺的购买预测☆18Jun 10, 2019Updated 6 years ago
- 根据GBDT衍生变量,并对衍生后的变量进行应用☆21Mar 15, 2020Updated 5 years ago
- A code repository for my Tianchi big data competition.☆117Mar 12, 2018Updated 7 years ago
- 关于综合评价一个评分卡模型的方法总结(附代码)☆74Feb 17, 2019Updated 7 years ago
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆21May 21, 2023Updated 2 years ago
- machinelearning的笔记,以及其他编程学习笔记☆22Nov 19, 2018Updated 7 years ago
- Comparison of XGBoost and LightGBM (speed, accuracy and complexity)☆21Dec 8, 2018Updated 7 years ago
- 七月算法 - 计算机视觉课程 课件☆20Dec 14, 2017Updated 8 years ago
- 通过对于现有开源分布式机器学习工具的整合(主要是基于参数服务器的logistic regression,xgboost,FFM,FM ),打造一个工业级的,可以线上使用的点击率预估流水线☆26Jun 6, 2017Updated 8 years ago
- First place solution of WSDM CUP 2020, pairwise-bert, lightgbm☆88Jan 22, 2020Updated 6 years ago
- 分类类别不平衡,解决办法:采样(SMOTE和算法集成技术等)、阈值移动、调整代价或权重,附带信用卡诈骗案例☆21Oct 8, 2019Updated 6 years ago