该工程是利用python3.6进行“特征提取+分类器”来实现美团评论的文本二分类问题。在特征提取部分提取了6种特征,分类器选择了python里面的包xgboost和lightGBM分别实现提升树和GBDT(梯度提升决策树)。
☆96Apr 8, 2019Updated 6 years ago
Alternatives and similar repositories for TextClassification-MeiTuan-FeatureExtraction-Classifier
Users that are interested in TextClassification-MeiTuan-FeatureExtraction-Classifier are comparing it to the libraries listed below
Sorting:
- 文本特征提取,适用于小说,论文,议论文等文本,提取词语、句子、依存关系等特征。python开发。☆42Jun 27, 2018Updated 7 years ago
- 文本特征值提取,采用结巴将文本分词,tf-idf算法得到特征值,以及给出了idf词频文件的训练方法☆20Feb 11, 2017Updated 9 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 6 years ago
- 京东评论情感分析模型,主要包括1、数据获取及探索性分析;2、文本预处理、文本分词、文本向量化、特征提取、☆84Jun 4, 2019Updated 6 years ago
- fastText中文实践☆19Apr 27, 2018Updated 7 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆26Dec 3, 2018Updated 7 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆43Jan 14, 2020Updated 6 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆35Jul 29, 2018Updated 7 years ago
- Conditional Similarity Networks (CSNs-Tensorflow)☆10Oct 29, 2018Updated 7 years ago
- ☆15Mar 19, 2017Updated 9 years ago
- 毕业设计项目,聊天机器人+情绪检测,可以初步实现测试与聊天机器人聊天用户的情绪状况.技术框架 :Seq2seq框架,LSTM,Attation机制,Tensorflow2.0+Keras,Html+Vue,Ajax 项目介绍 :重点研究了文本预处理、模型构建和训练,以及网…☆28Jun 18, 2023Updated 2 years ago
- 基于情感词典的文本情感极性分析☆27May 24, 2017Updated 8 years ago
- ☆11Oct 31, 2019Updated 6 years ago
- 对截止至2017年7月17日的债券违约事件进行梳理归因,并寻找宏观流动性影响因素,组成数据集。运用Lasso回归进行特征提取后,输入带L2惩罚项LR、SVM、NN、GBDT、RF等机器学习模型进行违约预测,得出GBDT预测效果最好以及特征工程对线性模型预测效果具有重要性的结…☆58Mar 7, 2019Updated 7 years ago
- 常用的特征选择方法☆67Jul 4, 2022Updated 3 years ago
- Human Facial Skin Defects Dataset☆11Aug 8, 2024Updated last year
- Self complemented text feature extraction using algorithms including CHI, DF, IG, MI for the experiment of text classification based on s…☆49Apr 18, 2018Updated 7 years ago
- Exploration of BERT-BiLSTM models with Layer Aggregation (attention-based and capsule-routing-based) and Hidden-State Aggregation (attent…☆25Mar 24, 2020Updated 5 years ago
- 完整的新词发现&词库构建例子☆20Mar 12, 2017Updated 9 years ago
- Twitter US Airline数据集情感分析(sentiment Analysis),使用Bert Sentence encoding作为特征,实现了SVM、XGBoost、RandomForest(随机森林)若干分类器。☆22Jan 19, 2020Updated 6 years ago
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- 针对手机评论数据的情感挖掘与分析项目,基于依存句法分析和情感词库提取特征词,并对特征词做情感极性预测标注。☆36Jul 6, 2016Updated 9 years ago
- 多轮对话槽填充☆20Jan 16, 2019Updated 7 years ago
- pytorch learning eamples☆25May 22, 2018Updated 7 years ago
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Feb 27, 2018Updated 8 years ago
- 第一届腾讯社交广告高校算法大赛Tencent_2017_contest☆24Sep 11, 2018Updated 7 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Jul 27, 2018Updated 7 years ago
- 本人精品课程毕业设计的重构,使用springboot框架重写☆20Jun 20, 2025Updated 9 months ago
- CCKS蚂蚁金服事件主体抽取☆14Jun 13, 2019Updated 6 years ago
- Kashgari 框架的中文文档☆22Sep 11, 2020Updated 5 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- 结合上下文和篇章特征的多标签情绪分类☆28Aug 19, 2016Updated 9 years ago
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 7 years ago
- 多轮对话系统的多意图识别☆31Mar 19, 2018Updated 8 years ago
- 文本分类-文本挖掘-情感分析-文本生成实战☆14Mar 22, 2023Updated 2 years ago
- Bert-Pytorch-Chinese-TextClassification☆27Oct 21, 2022Updated 3 years ago
- 【今日头条】 文本作者身份识别比赛☆10Aug 20, 2018Updated 7 years ago