## 数据挖掘流程 **(一)数据读取** - 读取数据,并进行展示 - 统计数据各项指标 - 明确数据规模与要完成的任务 **(二)特征理解分析** - 单特征分析,逐个变量分析其对结果的影响 - 多变量统计分析,综合考虑多种情况影响 - 统计绘图得出结论 **(三)数据清洗与预处理** - 对缺失值进行填充 - 特征标准化/归一化 - 筛选有价值的特征 - 分析特征之间的相关性 **(四)建立模型** - 特征数据与标签准备 - 数据集切分 - 多种建模算法对比 - 集成策略等方案改进
☆10Mar 11, 2020Updated 5 years ago
Alternatives and similar repositories for Titanic-data-mining
Users that are interested in Titanic-data-mining are comparing it to the libraries listed below
Sorting:
- 机器学习的特征工程,包括特征抽取、特征预处理、特征选择、特征降维。☆25Feb 25, 2019Updated 7 years ago
- CP-ABE测试加解密操作和密钥生成操作的性能☆11Jun 24, 2020Updated 5 years ago
- ☆22Jun 20, 2024Updated last year
- An address component tagger based on statistical natural language processing techniques☆11Apr 17, 2014Updated 11 years ago
- Python基于OpenCV的图像去雾算法[完整源码&部署教程]☆11Nov 17, 2023Updated 2 years ago
- (包含完整代码和坑点记录)Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆27Jan 22, 2026Updated last month
- Source code for Watch Your Back: Identifying Cybercrime Financial Relationships in Bitcoin through Back-and-Forth Exploration☆12Sep 4, 2024Updated last year
- You work for a consumer finance company which specializes in lending various types of loans to urban customers. When the company receives…☆13Sep 13, 2021Updated 4 years ago
- Publish/Subscribe Style Event Emitter☆13Jun 3, 2024Updated last year
- 南京大学2016年《数据新闻》课程☆10Jun 16, 2017Updated 8 years ago
- ☆12Dec 12, 2023Updated 2 years ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆13Mar 11, 2025Updated 11 months ago
- 暗通道去雾算法复现及改进☆14Aug 28, 2022Updated 3 years ago
- 哈工大机器学习作业一——多项式拟合曲线☆10Oct 19, 2016Updated 9 years ago
- ☆11Jun 21, 2022Updated 3 years ago
- 2024年舆情与新闻数据学的内容展示☆15Nov 22, 2024Updated last year
- ☆10Apr 22, 2018Updated 7 years ago
- 2020.06 亚马逊市场评论预测与情感分析模型——基于NLP☆10Jun 20, 2020Updated 5 years ago
- 表格印刷文字识别☆10Nov 24, 2018Updated 7 years ago
- 中国外交部模拟器网站首页 (People's Republic of China Oral Sex)☆10Apr 25, 2020Updated 5 years ago
- Text classification experiments using TextCNNs and Bi-attentive Classification Networks☆10Feb 18, 2019Updated 7 years ago
- Coinbase tag and coinbase output address based mining-pool identification for rust-bitcoin's bitcoin::{Block, Transaction}☆12Jan 22, 2026Updated last month
- 多模态情感分析模型实现☆11Jan 31, 2024Updated 2 years ago
- ☆13Nov 4, 2020Updated 5 years ago
- Learn about collaborative bitcoin transactions. Reclaim your privacy.☆15Apr 28, 2024Updated last year
- Open-source library for ORAM implementations☆11Apr 15, 2020Updated 5 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- 国密版Fabric、Fabric-SDK-GO、Fabric-CA测试demo☆12Oct 25, 2019Updated 6 years ago
- prototypical android app for voice-based user identification on Loomo by @Segway-Robotics☆11Jun 2, 2019Updated 6 years ago
- My Implementation for the paper EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks using Tensor…☆12Mar 18, 2022Updated 3 years ago
- Python code that we used in our USENIX 2021 paper to evaluate query recovery attacks☆17Nov 11, 2020Updated 5 years ago
- ☆15Apr 6, 2022Updated 3 years ago
- Source code of a online medicine shop to buy medicines and locate nearby medicine stores. This is my final year college project .☆11Dec 8, 2022Updated 3 years ago
- MSTI☆16Mar 6, 2024Updated last year
- Hybrid UNet model for traffic prediction from traffic movies. The hybrid graph operation is a mixture of CNN and GNN operations to captur…☆14Dec 13, 2022Updated 3 years ago
- 开源GIS课程-云南大学☆25Dec 16, 2025Updated 2 months ago
- ☆12Nov 14, 2024Updated last year
- Built quantitative models to measure value at risk (VaR) and Expected Shortfall (ES).☆13Aug 30, 2018Updated 7 years ago
- Welcome to 6.86x Machine Learning with Python–From Linear Models to Deep Learning. Machine learning methods are commonly used across eng…☆13Nov 16, 2020Updated 5 years ago