LaguaFluc/Text-classification-and-code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LaguaFluc/Text-classification-and-code)

LaguaFluc / Text-classification-and-code

毕业论文代码 + 评论文本数据获取+数据清洗+文本数据向量化+将数据放进分类器（KNN+Naive Bayes+SVM）中训练+结果评估

☆55

Alternatives and similar repositories for Text-classification-and-code

Users that are interested in Text-classification-and-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HeCCXX / svm_chinese_textclassification
View on GitHub
SVM中文文本分类
☆13Mar 13, 2022Updated 4 years ago
phjzlh / Restaurant-evaluation-Emotion-classification_machine-learning-RNN-Bi-LSTM
View on GitHub
使用2018年度部分大众点评的用户评价作为数据集，未筛选前共440万条评论数据，经过数据集的标签化处理以及中文文本的预处理、特征提取以及特征权重后，使用了SVM，朴素贝叶斯，Adabosst等经典机器学习方法进行分类，之后又使用了Bi-LSTM的深度神经网络进行训练分类。
☆13Nov 11, 2021Updated 4 years ago
qyfang / TextClassification
View on GitHub
基于scikit-learn实现对新浪新闻的文本分类，数据集为100w篇文档，总计10类，测试集与训练集1:1划分。分类算法采用SVM和Bayes，其中Bayes作为baseline。
☆110Dec 24, 2018Updated 7 years ago
dataaug / chatbot_multiround
View on GitHub
多轮中文聊天机器人，采用GPT2进行微调，清洗聊天数据110w+，采用语义相似度和文本jaccard相似度过滤回话。
☆22Nov 13, 2021Updated 4 years ago
chapzq77 / scik-learn-learn-Chinese-text-classider
View on GitHub
使用scik-learn 实现k-means，KNN,SVM,贝叶斯，topic_extraction等算法，同时评估分类的准确率，召回率和F值。语料库是中文文本
☆43Jul 23, 2017Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
wzuden / featureSelection
View on GitHub
文本分类之特征选择
☆11Aug 12, 2017Updated 8 years ago
Kinghao0319 / Amazon-market-review-analysis
View on GitHub
2020.06 亚马逊市场评论预测与情感分析模型——基于NLP
☆10Jun 20, 2020Updated 6 years ago
xuman-Amy / preprocessing
View on GitHub
数据预处理之缺失值处理，特征选择
☆23Apr 3, 2019Updated 7 years ago
sdunlp / nlp_Chinese
View on GitHub
Application for processing Chinese text : Sentiment , Keywords , Abstract
☆10Apr 13, 2017Updated 9 years ago
hellopikaqiu / text_classification_AI100
View on GitHub
AI100竞赛：http：//competition.ai100.com.cn/html/game_det.html？id = 24＆tab = 1 的代码，主要用于文本分类，其中涉及CHI选择特征词，TFIDF计算权重，朴素贝叶斯，决策树，SVM，XGBoost等算法
☆15Mar 27, 2019Updated 7 years ago
speciallurain / CNKI_Patent_SVM
View on GitHub
文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据中国知网URL的规律，爬取70多万条2014年公开的发明专利，然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取，依照词频排序提取…
☆108Mar 14, 2018Updated 8 years ago
Ninja91 / Human-Activity-Recognition
View on GitHub
Human Activity Recognition using ML on UCI HAR dataset
☆16Dec 13, 2014Updated 11 years ago
Sreyan88 / DALE
View on GitHub
Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP
☆11Oct 27, 2023Updated 2 years ago
snowlixue / Prison-term-based-on-Chinese-NER
View on GitHub
一种基于中文命名实体识别与线性回归的刑期预测方法
☆18Dec 1, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thinkwee / eda_zh_bert
View on GitHub
Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"
☆11Jul 25, 2019Updated 6 years ago
natureLanguageQing / datafountain_news
View on GitHub
baseline分享-互联网新闻情感分析
☆11Oct 12, 2019Updated 6 years ago
HorieYuan / LSTMTextSummarization
View on GitHub
基于多级LSTM的抽取式文本摘要
☆12Aug 20, 2024Updated last year
StuPeter / Company_Annual_Report_Analysis_TF
View on GitHub
上市公司年报分析
☆12Jul 16, 2019Updated 6 years ago
dido1998 / coqa-bert-baselines
View on GitHub
BERT baselines for extractive question answering on coqa (https://stanfordnlp.github.io/coqa/)
☆10Jan 27, 2020Updated 6 years ago
yangzhiye / Papers-For-Text-Summarization
View on GitHub
papers
☆19Jul 25, 2017Updated 8 years ago
FreeFlyXiaoMa / sentence_generation
View on GitHub
抽取式摘要抽取算法（1、抽取式 2、生成式）
☆16Oct 13, 2019Updated 6 years ago
ZahraAbbasiantaeb / SimQUAC
View on GitHub
☆10Sep 9, 2024Updated last year
ouprince / svm
View on GitHub
svm 情感分析
☆16Jun 28, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TeamLab / text-capsule-network
View on GitHub
☆15Jul 6, 2023Updated 2 years ago
buda-base / lucene-bo
View on GitHub
Lucene analyzer for Tibetan
☆12Oct 23, 2025Updated 8 months ago
YCG09 / tf-text-classification
View on GitHub
CNN and RNN with Attention for Chinese Text Classification in Tensorflow
☆85May 8, 2019Updated 7 years ago
joocxi / tf-flowqa
View on GitHub
A TensorFlow implementation of FlowQA
☆15Nov 24, 2018Updated 7 years ago
kaiyuanshe / 2016-China-Open-Source-Report
View on GitHub
Annual China Open Source Report by Kaiyuanshe
☆16Oct 25, 2016Updated 9 years ago
lucas-acc / HMM_POS
View on GitHub
a simple implementation of part-of-speech tagging with hmm
☆13Feb 26, 2019Updated 7 years ago
chz816 / esacl
View on GitHub
Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization
☆23Aug 26, 2021Updated 4 years ago
CONE-MT / BenchMAX
View on GitHub
☆29Mar 13, 2026Updated 3 months ago
JD-AI-Research-NLP / RoR
View on GitHub
☆14Jul 27, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
SuMeng123 / text_bert_cnn_rnn
View on GitHub
在 Google BERT Fine-tuning基础上，利用cnn/rnn进行中文文本的分类
☆19Oct 28, 2019Updated 6 years ago
Zbored / Chinese-sentiment-analysis
View on GitHub
gensim-word2vec+svm文本情感分析
☆103Sep 4, 2017Updated 8 years ago
FudanNLP / Tibetan-Classification
View on GitHub
☆19Jun 20, 2017Updated 9 years ago
NUSTM / PMI
View on GitHub
根据褒贬种子词，利用SO-PMI构建情感词典
☆26Oct 30, 2015Updated 10 years ago
yechong316 / Chinese-NER
View on GitHub
针对3个语料库，玻森数据 (https://bosonnlp.com) 、1998年人民日报标注数据、MSRA微软亚洲研究院开源数据，在前人的基础上，重新升级换代，达到更高的精确率。
☆13Jun 11, 2019Updated 7 years ago
itlubber / ChatGPTQAG
View on GitHub
基于ChatGPT的问答对自动生成，可复用于其他NLP领域
☆20Apr 3, 2023Updated 3 years ago
rainstorm12 / KG-RAG
View on GitHub
简单实现了一下基于知识图谱和文本文档联合做检索增强(RAG)大模型的实现，这里采用的数据分别是管廊维护领域的文本文档和专家知识图谱
☆24Jun 6, 2024Updated 2 years ago