mattzheng / py-yanwenziLinks
网络表情NLP,颜文字识别,颜文字表情实体识别、属性检测、新颜发现
☆44Updated 5 years ago
Alternatives and similar repositories for py-yanwenzi
Users that are interested in py-yanwenzi are comparing it to the libraries listed below
Sorting:
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆138Updated 5 years ago
- ☆102Updated 5 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Updated 2 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆77Updated 5 years ago
- 中文生成式预训练模型☆99Updated 5 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆109Updated 2 years ago
- Unilm for Chinese Chitchat Robot.基于Unilm模型的夸夸式闲聊机器人项目。☆158Updated 4 years ago
- NLP NER datasets video/music/book bio☆90Updated 4 years ago
- 中文版unilm预训练模型☆83Updated 4 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆214Updated 2 years ago
- 李傲龍的博客☆82Updated last year
- emoji switch(supporting Chinese and English)☆44Updated 4 years ago
- 用bert4keras来解小学数学应用题☆77Updated 5 years ago
- 基于预训练模型 BERT 的阅读理解☆96Updated last week
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆133Updated 2 years ago
- 中文纠错☆93Updated 3 years ago
- 基于bert进行中文文本纠错☆240Updated 2 years ago
- 夸夸语料,来自豆瓣互相表扬组数据☆78Updated 6 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆31Updated 5 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆197Updated 4 years ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆59Updated 7 years ago
- 教育行业新闻 自动文摘 语料库 自动摘要☆203Updated 7 years ago
- 在bert4keras下加载CPM_LM模型☆51Updated 5 years ago
- 各大中文分词性能评测☆159Updated 6 years ago
- ChineseHumorSentiment, chinese humor sentiment mining including corpus build and mining nlp methods.中文文本幽默情绪计算项目,项目包括幽默文本语料库的构建,幽默计算模型,包括…☆132Updated 6 years ago
- 本项目使用云问科技训练的中文版UniLM模型对微博数据集进行自动标题生成。☆39Updated last year
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆86Updated 2 years ago
- 用BERT在百度WebQA中文问答数据集上做阅读问答☆65Updated 5 years ago
- CLUEWSC2020: WSC Winograd模式挑战中文版,中文指代消解任务☆78Updated 5 years ago