比赛常用的特征工程、类别不平衡处理方法
☆17Aug 16, 2018Updated 7 years ago
Alternatives and similar repositories for data_process
Users that are interested in data_process are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆44Jan 14, 2020Updated 6 years ago
- 通过Featuretools自动特征工程预测贷款偿还☆20Jan 10, 2020Updated 6 years ago
- 分类类别不平衡,解决办法:采样(SMOTE和算法集成技术等)、阈值移动、调整代价或权重,附带信用卡诈骗案例☆21Oct 8, 2019Updated 6 years ago
- 马上AI全球挑战赛-违约用户风险预测 top2-solution☆18Jun 21, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 使用sklearn做特征工程☆178Jul 19, 2018Updated 7 years ago
- This repo has some proposed agenda for Azure Machine Learning related hands-on workshops.☆11Feb 2, 2021Updated 5 years ago
- Welcome to the SOLETE platform. These scripts are meant to help you using the homonymous dataset [1] and to replicate the results from th…☆12Sep 14, 2023Updated 2 years ago
- 对截止至2017年7月17日的债券违约事件进行梳理归因,并寻找宏观流动性影响因素,组成数据集。运用Lasso回归进行特征提取后,输入带L2惩罚项LR、SVM、NN、GBDT、RF等机器学习模型进行违约预测,得出GBDT预测效果最好以及特征工程对线性模型预测效果具有重要性的结…☆58Mar 7, 2019Updated 7 years ago
- dbn with smote 增强☆11Apr 12, 2019Updated 7 years ago
- China University Computer Contest-Big Data Challenge (A list: rank1, B list: rank2)☆37Feb 12, 2019Updated 7 years ago
- A Tensorflow Implementation of Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction☆14Mar 9, 2020Updated 6 years ago
- 通过将对上市公司招股说明书情绪分析的结果与常用财务指标、企业科研指标等结合,综合使用多种分类模型:传统LR、随机森林、XGB、LGB集成学习模型对新上市公司破发情况进行学习和预测,筛选重要特征,并由此来得到一个新股破发分类器。☆14Aug 26, 2023Updated 2 years ago
- Application of NLP, word embedding, LSTM, PCA, TSNE.☆13Sep 1, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A multi-label approach of the SMOTE algorithm☆12Aug 6, 2024Updated last year
- 讯飞移动广告反欺诈算法竞赛☆34Nov 1, 2019Updated 6 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 4 years ago
- 文本数据增强☆15Apr 10, 2020Updated 6 years ago
- 基于自构造函数的特征提取评分项目(缺失值处理,单变量相关性分析,特征评分,降维)☆15Jul 21, 2017Updated 8 years ago
- 基于attention的CNN文本分类☆15Dec 8, 2022Updated 3 years ago
- ALI-IJCAI-AD☆18May 15, 2018Updated 7 years ago
- 蓝泰源大数据基础平台☆17Mar 7, 2018Updated 8 years ago
- ☆10Feb 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 风控算法,特征工程,模型工程,分布式,树模型☆17Oct 31, 2023Updated 2 years ago
- 2018腾讯广告算法大赛代码☆13Jul 7, 2018Updated 7 years ago
- 14天完成数据分析实战项目☆10Sep 7, 2022Updated 3 years ago
- This repository aims to onboard new users into Modeling in SAP Data Warehouse Cloud in the most practical manner. For that you will build…☆17Feb 2, 2024Updated 2 years ago
- Feature selector is a tool for dimensionality reduction of machine learning datasets.☆19Jun 17, 2024Updated last year
- Adaptive Synthetic Sampling Approach for Imbalanced Learning☆13Jun 16, 2013Updated 12 years ago
- CHIP2018评测任务2,平安医疗科技智能患者健康咨询问句匹配大赛baseline,BiLSTM+特征工程计算相似性,10折交叉验证平均投票做bagging,F1值0.83左右,rank16。☆19Dec 4, 2018Updated 7 years ago
- 数据预处理之缺失值处理,特征选择☆23Apr 3, 2019Updated 7 years ago
- This repo is for the Linkedin Learning course: Predictive Customer Analytics☆12May 5, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 新版代码生成器☆10Apr 19, 2018Updated 8 years ago
- 《Python深度学习(第2版)》代码及笔记☆23Nov 24, 2022Updated 3 years ago
- Seq2seq Model on Time-series Data: Training and Serving with TensorFlow☆20Apr 26, 2019Updated 7 years ago
- Get 50 news from test data,find each of them the most similar 50 news in train data.☆16May 10, 2018Updated 7 years ago
- kaggle:Two Sigma Connect: Rental Listing Inquiries--top1☆127Oct 1, 2020Updated 5 years ago
- 分别基于statsmodels和scikit-learn实现两种可用于sklearn pipeline的 LogisticRegression,并输出相应的报告☆22May 21, 2023Updated 2 years ago
- Plugin for IDA Pro to convert assembler to LLVM IR☆20Nov 15, 2016Updated 9 years ago