毕业论文代码 + 评论文本数据获取+数据清洗+文本数据向量化+将数据放进分类器(KNN+Naive Bayes+SVM)中训练+结果评估
☆55May 17, 2022Updated 4 years ago
Alternatives and similar repositories for Text-classification-and-code
Users that are interested in Text-classification-and-code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 多轮中文聊天机器人,采用GPT2进行微调,清洗聊天数据110w+,采用语义相似度和文本jaccard相似度过滤回话。☆22Nov 13, 2021Updated 4 years ago
- 使用scik-learn 实现k-means,KNN,SVM,贝叶斯,topic_extraction等算法,同时评估分类的准确率,召回率和F值。语料库是中文文本☆43Jul 23, 2017Updated 8 years ago
- ☆12Nov 7, 2022Updated 3 years ago
- K-Means is a clustering algorithm which is used for cluster analysis in data mining; it partitions the data set into k clusters. In this …☆10Aug 19, 2017Updated 8 years ago
- Application for processing Chinese text : Sentiment , Keywords , Abstract☆10Apr 13, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据中国知网URL的规律,爬取70多万条2014年公开的发明专利,然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取,依照词频排序提取…☆108Mar 14, 2018Updated 8 years ago
- 基于深度学习(tensorflow)的中文文本分类☆15Apr 3, 2019Updated 7 years ago
- 一种基于中文命名实体识别与线性回归的刑期预测方法☆18Dec 1, 2018Updated 7 years ago
- Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"☆11Jul 25, 2019Updated 6 years ago
- 基于多级LSTM的抽取式文本摘要☆12Aug 20, 2024Updated last year
- ☆17Oct 8, 2023Updated 2 years ago
- [USENIX Security 2024] Official Repository of 'KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-…☆17Aug 6, 2025Updated 9 months ago
- papers☆19Jul 25, 2017Updated 8 years ago
- 抽取式摘要抽取算法(1、抽取式 2、生成式)☆16Oct 13, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- svm 情感分析☆16Jun 28, 2018Updated 7 years ago
- Malware analysis using Naive Bayes ML classfier☆11Sep 9, 2019Updated 6 years ago
- 一个用于中文文本的抽取式摘要工具 a tool for chinese summary extraction☆19Nov 24, 2020Updated 5 years ago
- ☆15Jul 6, 2023Updated 2 years ago
- Lucene analyzer for Tibetan☆12Oct 23, 2025Updated 7 months ago
- A TensorFlow implementation of FlowQA☆15Nov 24, 2018Updated 7 years ago
- Annual China Open Source Report by Kaiyuanshe☆16Oct 25, 2016Updated 9 years ago
- a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…☆16Jun 12, 2023Updated 2 years ago
- 使用Django搭建的基于Neo4j知识图谱的人际关系搜索与六度关系搜索系统,使用Mongo存储语料输出,使用Neo4j维护知识图谱☆15Apr 30, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- a simple implementation of part-of-speech tagging with hmm☆13Feb 26, 2019Updated 7 years ago
- Repo for Enhanced Seq2Seq Autoencoder via Contrastive Learning for Abstractive Text Summarization☆23Aug 26, 2021Updated 4 years ago
- Classifying text using Bi-LSTM, Transformer, BERT-ATT, BERT-BILSTM and BERT-CNN(文本分类)ChineseGLUE☆13Apr 2, 2023Updated 3 years ago
- 😎 Curated list of tibetan canon datasets☆17Apr 6, 2020Updated 6 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- django 校园卡管理系统☆49Dec 26, 2020Updated 5 years ago
- 在 Google BERT Fine-tuning基础上,利用cnn/rnn进行中文文本的分类☆19Oct 28, 2019Updated 6 years ago
- gensim-word2vec+svm文本情感分析☆104Sep 4, 2017Updated 8 years ago
- LCSTS,ROUGE,short text summarization,NLG,seq2seq☆23Jul 25, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 针对3个语料库,玻森数据 (https://bosonnlp.com) 、1998年人民日报标注数据、MSRA微软亚洲研究院开源数据,在前人的基础上,重新升级换代,达到更高的精确率。☆13Jun 11, 2019Updated 6 years ago
- 基于ChatGPT的问答对自动生成,可复用于其他NLP领域☆20Apr 3, 2023Updated 3 years ago
- TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。☆246Jul 21, 2023Updated 2 years ago
- NLPCC 2017 task3 article text summary☆23Jul 25, 2017Updated 8 years ago
- 对b站弹幕、评论进行爬虫,然后使用Word2Vec模型将其转化为词向量进行分析☆33May 2, 2022Updated 4 years ago
- Extrapolate gender from first names using Naïve-Bayes and PyTorch Char-RNN☆24Dec 27, 2017Updated 8 years ago
- Albert for Conversational Question Answering Challenge☆22Jun 12, 2023Updated 2 years ago