该工程是利用python3.6进行“特征提取+分类器”来实现美团评论的文本二分类问题。在特征提取部分提取了6种特征,分类器选择了python里面的包xgboost和lightGBM分别实现提升树和GBDT(梯度提升决策树)。
☆97Apr 8, 2019Updated 7 years ago
Alternatives and similar repositories for TextClassification-MeiTuan-FeatureExtraction-Classifier
Users that are interested in TextClassification-MeiTuan-FeatureExtraction-Classifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 文本特征值提取,采用结巴将文本分词,tf-idf算法得到特征值,以及给出了idf词频文件的训练方法☆21Feb 11, 2017Updated 9 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 7 years ago
- Popular machine learning algorithms, including GBDT, SVM and NN, implemented with simple python code.☆22Mar 3, 2018Updated 8 years ago
- fastText中文实践☆19Apr 27, 2018Updated 8 years ago
- 搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征☆26Dec 3, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 比赛常用的特征工程、类别不平衡处理方法☆17Aug 16, 2018Updated 7 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆36Jul 29, 2018Updated 7 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆44Jan 14, 2020Updated 6 years ago
- Conditional Similarity Networks (CSNs-Tensorflow)☆10Oct 29, 2018Updated 7 years ago
- ☆15Mar 19, 2017Updated 9 years ago
- 毕业设计项目,聊天机器人+情绪检测,可以初步实现测试与聊天机器人聊天用户的情绪状况.技术框架 :Seq2seq框架,LSTM,Attation机制,Tensorflow2.0+Keras,Html+Vue,Ajax 项目介绍 :重点研究了文本预处理、模型构建和训练,以及网…☆28Jun 18, 2023Updated 2 years ago
- 基于情感词典的文本情感极性分析☆27May 24, 2017Updated 9 years ago
- 完整的新词发现&词库构建例子☆20Mar 12, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Twitter US Airline数据集情感分析(sentiment Analysis),使用Bert Sentence encoding作为特征,实现了SVM、XGBoost、RandomForest(随机森林)若干分类器。☆21Jan 19, 2020Updated 6 years ago
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- 针对手机评论数据的情感挖掘与分析项目,基于依存句法分析和情感词库提取特征词,并对特征词做情感极性预测标注。☆36Jul 6, 2016Updated 9 years ago
- pytorch learning eamples☆25May 22, 2018Updated 8 years ago
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Feb 27, 2018Updated 8 years ago
- 基于关键词的无监督文本分类;Implementation for paper "Text Classification by Bootstrapping with Keywords, EM and Shrinkage" http://www.cs.cmu.edu/~knig…☆28Jan 28, 2021Updated 5 years ago
- 第一届腾讯社交广告高校算法大赛Tencent_2017_contest☆24Sep 11, 2018Updated 7 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.☆10Jul 27, 2018Updated 7 years ago
- CCKS蚂蚁金服事件主体抽取☆14Jun 13, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kashgari 框架的中文文档☆22Sep 11, 2020Updated 5 years ago
- 基于语义的文本相似度计算☆10Jan 22, 2019Updated 7 years ago
- create☆14Jan 6, 2017Updated 9 years ago
- 结合上下文和篇章特征的多标签情绪分类☆28Aug 19, 2016Updated 9 years ago
- A transformer model that should be able to solve a simple NER task☆11Mar 7, 2019Updated 7 years ago
- 多轮对话系统的多意图识别☆31Mar 19, 2018Updated 8 years ago
- CCF2018 数据挖掘 机器学习 智能匹配 特征工程☆50Sep 27, 2019Updated 6 years ago
- lda 主题模型 用于主题提取和文本分类☆26Jul 8, 2017Updated 8 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 某电商手机评论的文本挖掘初体验 功能板块:数据预处理、LDA模型获取特征词、情感极性判断与程度计算、回归模型预测销量排序☆124Mar 31, 2019Updated 7 years ago
- ☆14Nov 29, 2023Updated 2 years ago
- 2016CCF大数据与计算智能大赛——搜狗用户画像☆10Aug 18, 2017Updated 8 years ago
- Numerical combination of LDA and NMF cascaded with W2V to categorize 1M+ multi-lingual records into a 275-node, 5-level deep category tre…☆11Aug 29, 2020Updated 5 years ago
- 🎭 Sentiment Analysis with Neural Networks☆10Dec 4, 2016Updated 9 years ago
- 基于xgboost的风力电机叶片结冰分类预测☆11Jul 22, 2019Updated 6 years ago
- The project aims to improve the accuracy of target recognition through multi-feature fusion.Including manual feature extraction, deep lea…☆11Feb 18, 2020Updated 6 years ago