该工程是利用python3.6进行“特征提取+分类器”来实现美团评论的文本二分类问题。在特征提取部分提取了6种特征,分类器选择了python里面的包xgboost和lightGBM分别实现提升树和GBDT(梯度提升决策树)。
☆97Apr 8, 2019Updated 7 years ago
Alternatives and similar repositories for TextClassification-MeiTuan-FeatureExtraction-Classifier
Users that are interested in TextClassification-MeiTuan-FeatureExtraction-Classifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 文本特征提取,适用于小说,论文,议论文等文本,提取词语、句子、依存关系等特征。python开发。☆42Jun 27, 2018Updated 7 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 7 years ago
- 京东评论情感分析模型,主要包括1、数据获取及探索性分析;2、文本预处理、文本分词、文本向量化、特征提取、☆84Jun 4, 2019Updated 6 years ago
- Popular machine learning algorithms, including GBDT, SVM and NN, implemented with simple python code.☆22Mar 3, 2018Updated 8 years ago
- fastText中文实践☆19Apr 27, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 比赛常用的特征工程、类别不平衡处理方法☆17Aug 16, 2018Updated 7 years ago
- EasyMKL☆14Jun 12, 2018Updated 7 years ago
- A competition on DataCastle which is about text keyword extraction ! Rank 6 / 622 !☆16Jan 27, 2019Updated 7 years ago
- 以聚类算法、LDA主题模型、分类器为基础,完成对Twitter语料的基于地理位置的主题事件挖掘,并对主题事件进行细粒度的情绪分析☆35Jul 29, 2018Updated 7 years ago
- 数据特征工程、各种机器学习回归模型、回归数据预处理☆44Jan 14, 2020Updated 6 years ago
- Conditional Similarity Networks (CSNs-Tensorflow)☆10Oct 29, 2018Updated 7 years ago
- ☆15Mar 19, 2017Updated 9 years ago
- 基于情感词典的文本情感极性分析☆27May 24, 2017Updated 8 years ago
- 对截止至2017年7月17日的债券违约事件进行梳理归因,并寻找宏观流动性影响因素,组成数据集。运用Lasso回归进行特征提取后,输入带L2惩罚项LR、SVM、NN、GBDT、RF等机器学习模型进行违约预测,得出GBDT预测效果最好以及特征工程对线性模型预测效果具有重要性的结…☆58Mar 7, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 常用的特征选择方法☆67Jul 4, 2022Updated 3 years ago
- Human Facial Skin Defects Dataset☆11Aug 8, 2024Updated last year
- Self complemented text feature extraction using algorithms including CHI, DF, IG, MI for the experiment of text classification based on s…☆49Apr 18, 2018Updated 8 years ago
- Exploration of BERT-BiLSTM models with Layer Aggregation (attention-based and capsule-routing-based) and Hidden-State Aggregation (attent…☆25Mar 24, 2020Updated 6 years ago
- Twitter US Airline数据集情感分析(sentiment Analysis),使用Bert Sentence encoding作为特征,实现了SVM、XGBoost、RandomForest(随机森林)若干分类器。☆21Jan 19, 2020Updated 6 years ago
- Visualizes search engine ranking algorithms for a given domain☆30Dec 13, 2010Updated 15 years ago
- 针对手机评论数据的情感挖掘与分析项目,基于依存句法分析和情感词库提取特征词,并对特征词做情感极性预测标注。☆36Jul 6, 2016Updated 9 years ago
- 多轮对话槽填充☆20Jan 16, 2019Updated 7 years ago
- pytorch learning eamples☆25May 22, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A word hashing method based on vectors of letter n-grams. Currently transforms text into sequences of numbers.☆10Feb 27, 2018Updated 8 years ago
- 基于关键词的无监督文本分类;Implementation for paper "Text Classification by Bootstrapping with Keywords, EM and Shrinkage" http://www.cs.cmu.edu/~knig…☆28Jan 28, 2021Updated 5 years ago
- 第一届腾讯社交广告高校算法大赛Tencent_2017_contest☆24Sep 11, 2018Updated 7 years ago
- A neural text process python lib for context-based feature extraction on Seq-Tagging data.