fighting41love/Chinese_from_dongxiexidian

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fighting41love/Chinese_from_dongxiexidian)

fighting41love / Chinese_from_dongxiexidian

mirror of dongxiexidian/Chinese

☆306

Alternatives and similar repositories for Chinese_from_dongxiexidian

Users that are interested in Chinese_from_dongxiexidian are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhangyics / Chinese-abbreviation-dataset
View on GitHub
This is a corpus of Chinese abbreviation, including negative full forms.
☆198Jul 17, 2021Updated 5 years ago
foowaa / Chinese_from_dongxiexidian
View on GitHub
中文预处理语料
☆114Dec 18, 2018Updated 7 years ago
fighting41love / cocoNLP
View on GitHub
A Chinese information extraction tool.
☆1,129Jun 28, 2022Updated 4 years ago
thunlp / THUOCL
View on GitHub
THUOCL（THU Open Chinese Lexicon）中文词库
☆1,100Apr 3, 2023Updated 3 years ago
guotong1988 / chinese_dictionary
View on GitHub
同义词表，反义词表，否定词表
☆539Oct 17, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
observerss / textfilter
View on GitHub
敏感词过滤的几种实现+某1w词敏感词库
☆2,109Aug 20, 2021Updated 4 years ago
tinyfool / ChineseWithEnglish
View on GitHub
绝对有趣的中文发音引擎 funny chinese text to speech enginee
☆52Sep 4, 2013Updated 12 years ago
kfcd / chaizi
View on GitHub
漢語拆字字典
☆815Jan 8, 2023Updated 3 years ago
zedom1 / Error-Detection
View on GitHub
Code for chinese error detection module, using n-gram and bi-lstm
☆136Mar 31, 2019Updated 7 years ago
fighting41love / funNLP
View on GitHub
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…
☆81,942May 10, 2024Updated 2 years ago
LG-1 / video_music_book_datasets
View on GitHub
NLP NER datasets video/music/book bio
☆90Jan 3, 2021Updated 5 years ago
deepcs233 / jieba_fast
View on GitHub
Use C Api and Swig to Speed up jieba 高效的中文分词库
☆646Aug 27, 2021Updated 4 years ago
wainshine / Chinese-Names-Corpus
View on GitHub
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
☆4,318Nov 9, 2025Updated 8 months ago
zhanzecheng / Chinese_segment_augment
View on GitHub
python3实现互信息和左右熵的新词发现
☆593Aug 1, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
howl-anderson / tools_for_corpus_of_people_daily
View on GitHub
人民日报语料处理工具集 | Tools for Corpus of People's Daily
☆290Jul 6, 2023Updated 3 years ago
liuhuanyong / ChineseEmbedding
View on GitHub
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
☆455Dec 15, 2018Updated 7 years ago
qianzhengyang / AllDataPackages
View on GitHub
中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。
☆174Oct 12, 2021Updated 4 years ago
deadshot465 / novelcrafter-mcp
View on GitHub
An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.
☆11Dec 3, 2024Updated last year
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,904Feb 6, 2026Updated 5 months ago
liuhuanyong / QueryCorrection
View on GitHub
self complemented SpellCorrection based pinyin similairity, edit distance ，基于拼音相似度与编辑距离的查询纠错。
☆83May 20, 2022Updated 4 years ago
Embedding / Chinese-Word-Vectors
View on GitHub
100+ Chinese Word Vectors 上百种预训练中文词向量
☆12,229Oct 30, 2023Updated 2 years ago
rainarch / SentiBridge
View on GitHub
SentiBridge: A Knowledge Base for Entity-Sentiment Representation
☆639Sep 20, 2018Updated 7 years ago
WeblateOrg / hello
View on GitHub
Hello world demonstration for Weblate
☆15Jan 20, 2026Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vale-cli / SubVale
View on GitHub
A Sublime Text 3 client for Vale Server.
☆13Dec 7, 2020Updated 5 years ago
howl-anderson / hanzi_char_featurizer
View on GitHub
汉字字符特征提取器 (featurizer)，提取汉字的特征（发音特征、字形特征）用做深度学习的特征｜ A Chinese character feature extractor, which extracts the features of Chinese charac…
☆301Dec 29, 2025Updated 6 months ago
blmoistawinde / HarvestText
View on GitHub
文本挖掘和预处理工具（文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等），无监督或弱监督方法
☆2,623May 13, 2024Updated 2 years ago
esnme / landscape
View on GitHub
A Stylus-powered frontend CSS toolkit for building rich and beautiful web apps.
☆16Apr 2, 2012Updated 14 years ago
nonamestreet / weixin_public_corpus
View on GitHub
微信公众号语料库
☆593Jan 7, 2019Updated 7 years ago
InsaneLife / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆4,603Nov 21, 2023Updated 2 years ago
magesh-technovator / awesome-ai-applications
View on GitHub
A Comprehensive survey on business use cases of AI that help them thrive in the digital economy
☆13Oct 7, 2020Updated 5 years ago
google / arc-proselint
View on GitHub
A proselint linter for use with Phabricator's arc command line tool.
☆17Jun 17, 2016Updated 10 years ago
DjagbleyEmmanuel / llamafile-convert_gguf_UI
View on GitHub
This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…
☆14Jan 2, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
csisc / OpenCitations-Bot
View on GitHub
A bot to add citation data from OpenCitations to Wikidata
☆12May 23, 2023Updated 3 years ago
codemayq / chinese-chatbot-corpus
View on GitHub
中文公开聊天语料库
☆4,192Apr 23, 2024Updated 2 years ago
hltfbk / CROMER
View on GitHub
CROMER (CROss-document Main Events and entities Recognition), is a tool for cross-document coreference
☆12Jan 14, 2015Updated 11 years ago
jonathanwiesel / matterqus
View on GitHub
New disqus' comment notifier for Mattermost
☆10Nov 19, 2015Updated 10 years ago
SophonPlus / ChineseNlpCorpus
View on GitHub
搜集、整理、发布中文自然语言处理语料/数据集，与有志之士共同促进中文自然语言处理的发展。
☆6,588Jan 29, 2019Updated 7 years ago
iwater / node-stanford-corenlp
View on GitHub
A simple node.js wrapper for Stanford CoreNLP.
☆10Aug 7, 2014Updated 11 years ago
mozillazg / phrase-pinyin-data
View on GitHub
词语拼音数据
☆530Jul 20, 2025Updated last year