xieyan0811 / pdfconvLinks
中文PDF转TXT的实用工具
☆32Updated 4 years ago
Alternatives and similar repositories for pdfconv
Users that are interested in pdfconv are comparing it to the libraries listed below
Sorting:
- ChineseHumorSentiment, chinese humor sentiment mining including corpus build and mining nlp methods.中文文本幽默情绪计算项目,项目包括幽默文本语料库的构建,幽默计算模型,包括…☆134Updated 7 years ago
- 根据维基中文语料库预训练 GloVe 中文词向量;Pre-train GloVe word-embedding From Chinese Wiki corpus☆79Updated 2 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆65Updated 4 years ago
- 各大中文分词性能评测☆159Updated 6 years ago
- 百度百科爬虫☆76Updated last year
- Sequential Event Experiment based on Travel note crawled from XieCheng,基于50W携程出行游记的采集与顺承事件图谱构建.☆188Updated 7 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆176Updated 7 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆138Updated 5 years ago
- 基于ltp的简单评论观点抽取模块☆117Updated 7 years ago
- 基于bert进行中文文本纠错☆240Updated 2 years ago
- Sentence-Transformers Information Retrieval example on Chinese☆30Updated last year
- Dataset from 'Character-based BiLSTM-CRF Incorporating POS and Dictionaries for Chinese Opinion Target Extraction'☆45Updated 7 years ago
- 使用pyltp的工具,基于中文依存句法的四大名著人物情节分析系统。分为整体分析和章节分析两大模块,实现了人物篇幅分析,故事发生地分析,主要人物情绪变化分析,人物互动情况分析.☆94Updated 8 years ago
- 基于哈工大同义词词林扩展版的单词相似度计算方法☆372Updated 2 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆170Updated 4 years ago
- Word similarity computation based on Tongyici Cilin☆121Updated 8 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated 2 years ago
- Event monitor based on online news corpus including event storyline and analysis,基于给定事件关键词,采集事件资讯,对事件进行挖掘和分析。☆153Updated 7 years ago
- 利用文本分析算法和Python脚本,自动纠正word中的英语单词拼写错误☆48Updated 7 years ago
- 一个简单易用的 Python 模块,用于通过字符串来操作日期/时间。正则时间提取,字符串时间解析,字符串时间提取。中文时间提取,一句话里面提取时间☆76Updated 3 weeks ago
- Code for chinese error detection module, using n-gram and bi-lstm☆135Updated 6 years ago
- 中文文本情感分类数据集分享 chinese sentiment datasets☆83Updated 5 years ago
- 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆287Updated 2 years ago
- 中文语料库:包括情感词典 情感分析 文本分类 单轮对话 中文词典 知乎☆118Updated 7 years ago
- AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。☆77Updated 4 years ago
- Self complemented sentiment words expansion using seed sentiment words and so-pmi , this method is tested to be effective, 基于情感种子词与so-pmi…☆87Updated 7 years ago
- 微调预训练语言模型(BERT、Roberta、XLBert等),用于计算两个文本之间的相似度(通过句子对分类任务转换),适用于中文文本☆90Updated 5 years ago
- Bert分类,语义相似度,获取句向量。☆65Updated 8 months ago
- 专业领域词库构建/中文新词发现/专业词库发现☆30Updated 5 years ago
- 金庸小说人物关系图谱构建☆63Updated 6 years ago