xieyan0811 / pdfconv
中文PDF转TXT的实用工具
☆30Updated 3 years ago
Alternatives and similar repositories for pdfconv:
Users that are interested in pdfconv are comparing it to the libraries listed below
- 公司名简称生成,采用马尔科夫构造序列标注概率分布,使用维特比前后向算法推导生成。☆27Updated 6 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- 利用文本分析算法和Python脚本,自动纠正word中的英语单词拼写错误☆47Updated 6 years ago
- 电商评论观点挖掘☆39Updated 5 years ago
- Event monitor based on online news corpus including event storyline and analysis,基于给定事件关键词,采集事件资讯,对事件进行挖掘和分析。☆152Updated 6 years ago
- 发现新词 无监督词库生成 医学词库生成 发现未登录词☆77Updated 4 years ago
- 百度百科爬虫☆33Updated 5 years ago
- 使用Simhash对海量文本进行去重☆12Updated 6 years ago
- 专业领域词库构建/中文新词发现/专业词库发现☆29Updated 5 years ago
- Sequential Event Experiment based on Travel note crawled from XieCheng,基于50W携程出行游记的采集与顺承事件图谱构建.☆181Updated 6 years ago
- ChineseHumorSentiment, chinese humor sentiment mining including corpus build and mining nlp methods.中文文本幽默情绪计算项目,项目包括幽默文本语料库的构建,幽默计算模型,包括…☆119Updated 6 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆62Updated 3 years ago
- Word similarity computation based on Tongyici Cilin☆119Updated 7 years ago
- ZhidaoChatbot, a chatbot that can be an expert on the common questions like why,how,when,who,what based on the online question-answer web…☆42Updated 6 years ago
- 基于gensim模块的中文句子相似度计算☆53Updated 6 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- 基于ltp的简单评论观点抽取模块☆116Updated 6 years ago
- 知识图谱的小demo☆17Updated 6 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆172Updated 6 years ago
- 基于互信息和邻接信息熵的中文分词和新词发现☆14Updated 6 years ago
- 中文环境突发事件语料库(Chinese Environment Emergency Corpus)-上海大学-语义智能实验室☆46Updated 9 years ago
- 微博自动摘要系统 Chinese Microblog Automatic Summary System☆30Updated 5 years ago
- 端到端的基于知识图谱的问答系统,分为实体识别和关系分类两部,在BERT基础上做多任务联合训练。☆30Updated 5 years ago
- WordMultiSenseDisambiguation, chinese multi-wordsense disambiguation based on online bake knowledge base and semantic embedding similarit…☆127Updated 6 years ago
- 企业事件抽取☆14Updated 3 years ago
- 金庸小说人物关系图谱构建☆61Updated 5 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆30Updated 4 years ago
- 百度百科爬虫☆71Updated 9 months ago
- 2018-“AIIA”杯-国家电网-电力专业领域词汇挖掘:5st/451☆25Updated 6 years ago
- Cognitive Inference,认知推理、常识知识库、常识推理与常识推理评估的系统项目,以现有国内外已有的常识知识库为研究对象,从常识知识库资源建设和常识推理测试评估两个方面出发进行整理,并结合自己近几年来在逻辑性推理知识库的构建、应用以及理论思考进行介绍。具体包括…☆122Updated 4 years ago