xieyan0811 / pdfconv
中文PDF转TXT的实用工具
☆30Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for pdfconv
- 用BERT在百度WebQA中文问答数据集上做阅读问答☆65Updated 4 years ago
- 专业领域词库构建/中文新词发现/专业词库发现☆28Updated 4 years ago
- Sentence-Transformers Information Retrieval example on Chinese☆29Updated 9 months ago
- self complemented WeiboIndexSpyder based on Selenium ,新浪微博指数(微指数)采集,包括综合指数,移动端指数,PC端指数☆31Updated 6 years ago
- 基于gensim模块的中文句子相似度计算☆54Updated 6 years ago
- 该部分停止更新,升级项目地址:https://github.com/we-chatter/chatbot_utils☆34Updated last year
- 使用词性模板抽取中文语料中的名词短语☆17Updated 3 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆79Updated 2 years ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆90Updated 4 years ago
- DescriptionPairsExtraction, entity and it's description pairs extract program based on Albert and data back-annotation. 基于Albert与结构化数据回标思…☆20Updated 2 years ago
- 基于文本相似度的win10智能客服问答系统☆14Updated 4 years ago
- 常用中文停用词表及对比☆65Updated 5 years ago
- 金庸小说人物关系图谱构建☆63Updated 5 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- 关键词抽取项目☆24Updated 4 years ago
- 时间关键词正则提取以及标准化☆21Updated 2 years ago
- 2020智源-京东多模态对话(JDDC2020)第三名解决方案分享☆41Updated 4 years ago
- 使用Simhash对海量文本进行去重☆11Updated 6 years ago
- worddict crawler and transfer for sougpuinput wordict , 搜狗输入法词库抓取与格式转换☆25Updated 6 years ago
- ☆18Updated last year
- 基于bert的中文实体链接☆29Updated 2 years ago
- “AIIA”杯-国家电网-电力专业领域词汇挖掘☆54Updated 2 years ago
- 用Paddle复现论文ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information(ACL2021)☆9Updated 3 years ago
- 2018-“AIIA”杯-国家电网-电力专业领域词汇挖掘:5st/451☆24Updated 5 years ago
- 知识图谱的小demo☆16Updated 6 years ago
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword,针对中文词语的反义词查询接口☆58Updated 6 years ago
- 疫情期间网民情绪识别比赛baseline,使用BERT进行端到端的fine-tuning,datafountain平台,平台评测F1值0.716。☆35Updated 4 years ago
- 用tf实现 各种文本分类模型,并且封装restful接口,可以直接工程化☆32Updated 5 years ago