Mining synonyms from unstructured and semi-structured data
☆250Dec 3, 2024Updated last year
Alternatives and similar repositories for synonym_detection
Users that are interested in synonym_detection are comparing it to the libraries listed below
Sorting:
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- extractor chinese synonyms in large corpus☆11Jul 20, 2016Updated 9 years ago
- Entity Synonym Discovery via Multipiece Bilateral Context Matching (IJCAI'20) https://arxiv.org/abs/1901.00056☆31Mar 24, 2023Updated 2 years ago
- PyTorch implementation of paper "Mining Entity Synonyms with Efficient Neural Set Generation" in AAAI 2019☆67Nov 26, 2021Updated 4 years ago
- 基于哈工大同义词词林扩展版的单词相似度计算方法☆373May 11, 2023Updated 2 years ago
- 中文近义词表 Chinese Synonyms☆263Jan 20, 2018Updated 8 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,106Feb 1, 2026Updated last month
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- ☆15May 29, 2021Updated 4 years ago
- Chinese Synonyms 中文同义词查询工具包☆18Oct 9, 2022Updated 3 years ago
- baidu aistudio event extraction competition☆224Mar 24, 2023Updated 2 years ago
- Word similarity computation based on Tongyici Cilin☆122Jun 27, 2017Updated 8 years ago
- 同义词表,反义词表,否定词表☆542Oct 17, 2024Updated last year
- 同义词扩展☆27Feb 16, 2016Updated 10 years ago
- Train Wikidata with word2vec for word embedding tasks☆123Jul 3, 2018Updated 7 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆779Mar 17, 2023Updated 2 years ago
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 5 years ago
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示☆171Oct 6, 2018Updated 7 years ago
- Chinese Synonym Library☆123Apr 10, 2018Updated 7 years ago
- 常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中☆674Oct 12, 2019Updated 6 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆214Mar 24, 2023Updated 2 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,374Jan 12, 2026Updated last month
- Watset: Automatic Induction of Synsets from a Graph of Synonyms☆16Jul 7, 2019Updated 6 years ago
- keras implement of transformers for humans☆5,421Nov 11, 2024Updated last year
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- An attempt at replicating the Induction Network for FewRel data in Tensorflow☆178Nov 1, 2019Updated 6 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,183Oct 30, 2023Updated 2 years ago
- 通过示例阐述如何使用pycrfsuite☆10Nov 7, 2016Updated 9 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- ☆25May 4, 2022Updated 3 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,175Jul 15, 2025Updated 7 months ago
- Entity and Relation Extraction Based on TensorFlow and BERT. 基于TensorFlow和BERT的管道式实体及关系抽取,2019语言与智能技术竞赛信息抽取任务解决方案。Schema based Knowledge …☆1,230Jun 1, 2020Updated 5 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora☆153Aug 31, 2021Updated 4 years ago
- An implementation of the Watset clustering algorithm in Java.☆30Dec 9, 2022Updated 3 years ago
- 百度开源的依存句法分析系统☆1,003Feb 5, 2023Updated 3 years ago
- CCKS 2019 中文短文本实体链指比赛技术创新奖解决方案☆412Mar 24, 2023Updated 2 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Dec 13, 2019Updated 6 years ago