中文文本分析工具、语料、预训练模型相关资源汇总。
☆144Sep 12, 2025Updated 8 months ago
Alternatives and similar repositories for Chinese-Pretrained-Word-Embeddings
Users that are interested in Chinese-Pretrained-Word-Embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 中文情感分析库(Chinese Sentiment))可对文本进行情绪分析、正负情感分析。Text analysis, supporting multiple methods including word count, readability, document simil…☆589Dec 9, 2022Updated 3 years ago
- 涵盖网络爬虫、数据库、数据分析、机器学习、可视化、文本分析、GUI、自动化办公☆14Jan 14, 2022Updated 4 years ago
- cntext is a Python library for social science text analysis, offering word frequency, sentiment, word embeddings, and semantic projection…☆453May 3, 2026Updated 2 weeks ago
- 2021/7/9测试KwaiSurvival的实验代码☆11Aug 17, 2021Updated 4 years ago
- 中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)☆732Oct 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Apr 4, 2018Updated 8 years ago
- An implementation of the exponential random graph model☆28May 14, 2014Updated 12 years ago
- CoMOLA is a generic python tool for Constrained Multi-objective Optimization of Land use Allocation. It offers a framework to explore a l…☆30Dec 10, 2025Updated 5 months ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,613May 13, 2024Updated 2 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago
- 基于TF-IDF和余弦定理计算文本相似度☆36Aug 29, 2018Updated 7 years ago
- Applied BERT based model to extract relations from 29 annual reports of listed companies and news; Used spaCy library and BERT model for …☆13Feb 2, 2022Updated 4 years ago
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆39Mar 30, 2024Updated 2 years ago
- 使用唐诗语料库,经过去噪预处理、分词、生成搭配、生成主题等过程,生成唐诗。基于Python☆15Aug 14, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 使用SO_PMI互信息算法、词向量法快速构建不同领域(手机、汽车等)的专业情感词典☆94Nov 16, 2021Updated 4 years ago
- Spark—Python学习笔记☆11Sep 25, 2018Updated 7 years ago
- Aligned bilingual word vectors for English and Chinese☆11Jun 25, 2018Updated 7 years ago
- ☆10Oct 20, 2020Updated 5 years ago
- 文本分类-文本挖掘-情感分析-文本生成实战☆14Mar 22, 2023Updated 3 years ago
- 使用中文情感词汇本体库进行情感分析,之后对每种情感的文本进行主题分析。Using Chinese Sentiment Dictionary for Sensitive Analysis, Then applying LDA Topic Analysis for each E…☆14Jan 20, 2021Updated 5 years ago
- Identified which Enron employees are more likely to have committed fraud using machine learning and public Enron financial and email data…☆12May 26, 2017Updated 8 years ago
- ☆10Jun 2, 2023Updated 2 years ago
- ComplexNetworkSim is a Python package for the simulation of agents connected in a complex network.☆21Apr 24, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Nov 27, 2018Updated 7 years ago
- 京东/淘宝客服对话数据公开,seq2seq生成模型设计对话系统获第二名☆44Dec 8, 2022Updated 3 years ago
- 基于CEC语料库挖掘要素识别规则,对新闻报道类生语料进行自动标注☆21May 14, 2015Updated 11 years ago
- Predicting gender of given Chinese names (93~99% test set accuracy). 预测中文姓名的性别(93~99%的测试集准确率)。☆27Sep 18, 2025Updated 8 months ago
- 人工智能大作业:关于计算文本相似度的深度神经网络模型与算法研究分析(BERT、SentenceBERT、SimCSE)☆17Jul 11, 2022Updated 3 years ago
- This is the course page of the summer school organized by Chunyang Fu from UCASS.☆14Aug 11, 2021Updated 4 years ago
- Chinese Sentiment Analysis 中文文本情感分析☆192Mar 10, 2026Updated 2 months ago
- 大型中文道德句数据集CMOS☆10Apr 11, 2022Updated 4 years ago
- ☆32Mar 19, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Stata command to generate color schemes☆18Mar 30, 2026Updated last month
- 中文对话数据清洗☆32Nov 8, 2022Updated 3 years ago
- Predicting oncogenic potential of gene fusions☆13Feb 13, 2016Updated 10 years ago
- 中文情感分析,CNN,BI-LSTM,文本分类☆1,088Oct 22, 2022Updated 3 years ago
- Python3 实现的文章余弦相似度计算☆10Sep 28, 2017Updated 8 years ago
- 从马蜂窝、大众点评、穷游、猫途鹰 抓取热门城市、POI☆11Nov 30, 2016Updated 9 years ago
- 中文姓名与性别的相关性分析☆13May 16, 2016Updated 10 years ago