howl-anderson / scel2txtLinks
搜狗细胞词库到普通文本的转换提取工具。提取词汇表,用于深度学习做数据生成和字典特征
☆25Updated 6 years ago
Alternatives and similar repositories for scel2txt
Users that are interested in scel2txt are comparing it to the libraries listed below
Sorting:
- 图书名语料库。含部分电影、游戏名称。☆72Updated last year
- This is a corpus of Chinese abbreviation, including negative full forms.☆196Updated 4 years ago
- 中文相关词典和语料库。☆175Updated 11 years ago
- 转换搜狗拼音词库为txt文件☆50Updated 7 years ago
- Corpus creator for Chinese Wikipedia☆41Updated 4 years ago
- 下载搜狗、百度、QQ输入法的词库文件的 python 爬虫,可用于构建不同行业的词汇库☆116Updated 7 years ago
- auto generate chinese words in huge text.☆92Updated 10 years ago
- 点睛 - 头条号文章标题生成工具 (Dianjing, AI to write Title for Articles)☆242Updated 7 years ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆69Updated last year
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆170Updated 3 years ago
- 物种名称语料库。植物名,动物名。☆51Updated last year
- 新词发现算法(NewWordDetection)☆93Updated 4 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 7 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆84Updated 3 years ago
- 古汉语(文言文)字典-爬取文言文字典网,制作Kindle字典.☆67Updated 7 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 8 years ago
- A readability parser which can extract title, content, images from html pages☆87Updated 5 years ago
- Train Wikidata with word2vec for word embedding tasks☆123Updated 7 years ago
- chatbot based on music region using method including es and music kb.基于14W歌曲知识库的问答尝试,功能包括歌词接龙,已知歌词找歌曲以及歌曲歌手歌词三角关系的问答。☆278Updated 6 years ago
- WordForm,针对中文词语的笔画拆解,偏旁查询,拼音转换接口☆65Updated 7 years ago
- 菜谱名语料库。☆16Updated 4 years ago
- self complement of baike knowledge base info-box extraction by online analysis.基于互动百科,百度百科,搜狗百科的词条infobox结构化信息抽取,百科知识的融合☆36Updated 7 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Updated 2 years ago
- 金庸小说 人物关系图谱构建☆63Updated 5 years ago
- ☆61Updated last year
- 搜狗、百度、QQ输入法的词库文件的 Java 解析程序,配合 ThesaurusSpider 使用☆108Updated 5 years ago
- ZhidaoChatbot, a chatbot that can be an expert on the common questions like why,how,when,who,what based on the online question-answer web…☆42Updated 6 years ago
- 对红楼梦的各回目进行分类☆37Updated 8 years ago
- 一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer designed for educational and research purposes. Provides a…☆157Updated 10 months ago
- 古诗词语料库☆136Updated 8 years ago