zhangyics/Chinese-abbreviation-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhangyics/Chinese-abbreviation-dataset)

zhangyics / Chinese-abbreviation-dataset

This is a corpus of Chinese abbreviation, including negative full forms.

☆198

Alternatives and similar repositories for Chinese-abbreviation-dataset

Users that are interested in Chinese-abbreviation-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kfcd / chaizi
View on GitHub
漢語拆字字典
☆817Jan 8, 2023Updated 3 years ago
guotong1988 / chinese_dictionary
View on GitHub
同义词表，反义词表，否定词表
☆539Oct 17, 2024Updated last year
rainarch / SentiBridge
View on GitHub
SentiBridge: A Knowledge Base for Entity-Sentiment Representation
☆638Sep 20, 2018Updated 7 years ago
yaleimeng / Final_word_Similarity
View on GitHub
综合了同义词词林扩展版与知网（Hownet）的词语相似度计算方法，词汇覆盖更多、结果更准确。
☆744Feb 16, 2022Updated 4 years ago
zedom1 / Error-Detection
View on GitHub
Code for chinese error detection module, using n-gram and bi-lstm
☆136Mar 31, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fighting41love / Chinese_from_dongxiexidian
View on GitHub
mirror of dongxiexidian/Chinese
☆306Dec 18, 2018Updated 7 years ago
tinyfool / ChineseWithEnglish
View on GitHub
绝对有趣的中文发音引擎 funny chinese text to speech enginee
☆52Sep 4, 2013Updated 12 years ago
wainshine / Company-Names-Corpus
View on GitHub
公司名语料库。机构名语料库。公司简称,缩写,品牌词,企业名。可用于中文分词、机构名实体识别。
☆1,294Mar 27, 2024Updated 2 years ago
wainshine / Chinese-Names-Corpus
View on GitHub
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
☆4,320Nov 9, 2025Updated 8 months ago
crownpku / Small-Chinese-Corpus
View on GitHub
Some useful Chinese corpus datasets 中文语料小数据
☆547Mar 29, 2020Updated 6 years ago
nonamestreet / weixin_public_corpus
View on GitHub
微信公众号语料库
☆593Jan 7, 2019Updated 7 years ago
howl-anderson / tools_for_corpus_of_people_daily
View on GitHub
人民日报语料处理工具集 | Tools for Corpus of People's Daily
☆290Jul 6, 2023Updated 3 years ago
langcog / metalab-archive
View on GitHub
meta-analyses of language acquisition phenomena
☆13May 28, 2019Updated 7 years ago
howl-anderson / hanzi_char_featurizer
View on GitHub
汉字字符特征提取器 (featurizer)，提取汉字的特征（发音特征、字形特征）用做深度学习的特征｜ A Chinese character feature extractor, which extracts the features of Chinese charac…
☆301Dec 29, 2025Updated 7 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
z17176 / Chinese_conversation_sentiment
View on GitHub
A Chinese sentiment dataset may be useful for sentiment analysis.
☆234Nov 15, 2016Updated 9 years ago
liuhuanyong / AbstractKnowledgeGraph
View on GitHub
AbstractKnowledgeGraph, a systematic knowledge graph that concentrate on abstract thing including abstract entity and action. 抽象知识图谱，目前规模…
☆248Aug 6, 2019Updated 6 years ago
ymcui / Chinese-Cloze-RC
View on GitHub
A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)
☆175Mar 26, 2019Updated 7 years ago
LG-1 / video_music_book_datasets
View on GitHub
NLP NER datasets video/music/book bio
☆90Jan 3, 2021Updated 5 years ago
khiajohnson / SpiCE-Corpus
View on GitHub
An open-access corpus of conversational bilingual speech in Cantonese and English
☆40Apr 28, 2022Updated 4 years ago
Azhag / Bayesian-visual-working-memory
View on GitHub
Bayesian Visual Working Memory in Python.
☆13Mar 28, 2020Updated 6 years ago
charlesXu86 / char_featurizer
View on GitHub
汉字字符特征提取工具，可以提取出字符中的字音（声母、韵母、声调）、字形（偏旁、部首）、四角编码等特征，同时可作为tensor输入到模型
☆138May 25, 2020Updated 6 years ago
codemayq / chinese-chatbot-corpus
View on GitHub
中文公开聊天语料库
☆4,193Apr 23, 2024Updated 2 years ago
chatopera / Synonyms
View on GitHub
中文近义词：聊天机器人，智能问答工具包
☆5,108Feb 1, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Ghostwritten / GitBook-auto-summary
View on GitHub
Automatically update SUMMARY.md of a GitBook repo，default Based on the markdown title, not the article name，But if without a title, artic…
☆11Jul 6, 2022Updated 4 years ago
google-research-datasets / uninum
View on GitHub
A database of number names for 186 languages, locales, and scripts
☆67Mar 3, 2023Updated 3 years ago
startprogress / China_stock_announcement
View on GitHub
该项目通过scrapy爬虫从巨潮网络的服务器获取中国股市的公告
☆217May 24, 2020Updated 6 years ago
baijiangliang / year2018
View on GitHub
Annual report for programmers.
☆21Jan 3, 2019Updated 7 years ago
sakuranew / BERT-AttributeExtraction
View on GitHub
USING BERT FOR Attribute Extraction in KnowledgeGraph. fine-tuning and feature extraction. …
☆266Apr 1, 2019Updated 7 years ago
fighting41love / cocoNLP
View on GitHub
A Chinese information extraction tool.
☆1,129Jun 28, 2022Updated 4 years ago
CooperMin / qichacha
View on GitHub
企查查企业分类信息采集
☆43Apr 2, 2020Updated 6 years ago
deadshot465 / novelcrafter-mcp
View on GitHub
An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.
☆11Dec 3, 2024Updated last year
hscspring / ALL4AI
View on GitHub
AI Related Tools/Projects
☆25Apr 20, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
clearboy / IA03BP
View on GitHub
通识教育的信息、系统论、控制论解读
☆12Jan 16, 2019Updated 7 years ago
chatopera / insuranceqa-corpus-zh
View on GitHub
保险行业语料库，聊天机器人
☆1,063May 26, 2025Updated last year
ashengtx / CilinSimilarity
View on GitHub
Word similarity computation based on Tongyici Cilin
☆122Jun 27, 2017Updated 9 years ago
Embedding / Chinese-Word-Vectors
View on GitHub
100+ Chinese Word Vectors 上百种预训练中文词向量
☆12,228Oct 30, 2023Updated 2 years ago
jermnelson / BIBFRAME-Datastore
View on GitHub
BIBFRAME Datastore is a Linked-Data project for managing bibliographic records and operational data focused on libraries and other simila…
☆16Sep 17, 2015Updated 10 years ago
fighting41love / hardNLU
View on GitHub
NLU is hard!!!
☆273Mar 19, 2019Updated 7 years ago
panhaiqi / AncientPoetry
View on GitHub
古诗词语料库
☆138Mar 25, 2017Updated 9 years ago