Tianyijian / poetry
china ancient poetry project data
☆19Updated 6 years ago
Alternatives and similar repositories for poetry:
Users that are interested in poetry are comparing it to the libraries listed below
- 古诗词语料库☆129Updated 8 years ago
- 中国诗词歌赋数据库 总计82万余首(827108) CSV 格式 简体中文 按照number有序☆59Updated last month
- 比较全的中华古诗古词古文库,包括21万首古诗词,以及注释、赏析等信息,包含10000多名诗人以及诗人的介绍、生平等,同时包含,1600多个词牌介绍,中国70多个朝代解析,和古诗文的近200个分类标签☆340Updated last year
- chatbot based on music region using method including es and music kb.基于14W歌曲知识库的问答尝试,功能包括歌词接龙,已知歌词找歌曲以及歌曲歌手歌词三角关系的问答。☆270Updated 6 years ago
- BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry☆153Updated 3 years ago
- 图书名语料库。含部分电影、游戏名称。☆71Updated 11 months ago
- 汉语古典文本资料库☆272Updated 7 years ago
- 中文相关词典和语料库。☆172Updated 10 years ago
- 同义词表,反义词表,否定词表☆527Updated 5 months ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆134Updated 4 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆194Updated 3 years ago
- 汉字数据集,包括汉字的相关信息,例如笔画数、部首、拼音、英文释义/同义词等。☆115Updated 4 years ago
- Chinese Classic Poem Mining Project including corpus buiding by spyder and content analysis by nlp methods, 基于爬虫与nlp的中国古代诗词文本挖掘项目☆112Updated 6 years ago
- Poetry-related datasets developed by THUAIPoet (Jiuge) group.☆225Updated 4 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆66Updated 6 years ago
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆365Updated 5 months ago
- 中华古诗文数据库和API。包含10000首古文(诗、词、歌、赋以及其它形式的文言文),近4000名作者,10000名句☆474Updated 7 months ago
- 金庸小说人物关系图谱构建☆61Updated 5 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 6 years ago
- 物种名称语料库。植物名,动物名。☆48Updated 11 months ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- NLP NER datasets video/music/book bio☆88Updated 4 years ago
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示☆170Updated 6 years ago
- 一个中文的已标注词性的语料库☆201Updated 10 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆72Updated 5 years ago
- Train Wikidata with word2vec for word embedding tasks☆122Updated 6 years ago
- 成语数据 Chinese idiom data☆74Updated 7 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- course project☆122Updated 5 years ago
- 爬取自互联网的古诗词语料库,包含先秦至当代诗词,共计1014508首诗☆30Updated 3 years ago