Automated Phrase Mining from Massive Text Corpora in Python.
☆175May 23, 2021Updated 4 years ago
Alternatives and similar repositories for AutoPhraseX
Users that are interested in AutoPhraseX are comparing it to the libraries listed below
Sorting:
- AutoPhrase: Automated Phrase Mining from Massive Text Corpora☆1,201Jan 27, 2022Updated 4 years ago
- ☆10Jan 7, 2020Updated 6 years ago
- 速度更快、效果更好的中文新词发现☆513Mar 15, 2024Updated last year
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆433May 17, 2020Updated 5 years ago
- An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsuper…☆48Apr 25, 2023Updated 2 years ago
- Knowledge Graph☆176Aug 19, 2022Updated 3 years ago
- 中文纠错☆91Mar 7, 2022Updated 3 years ago
- ☆11Nov 16, 2022Updated 3 years ago
- python3实现互信息和左右熵的新词发现☆593Aug 1, 2019Updated 6 years ago
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- Let ChatGPT (Large Language Models) Serve As Data Annotator and Zero-shot/few-shot Information Extractor.☆32Mar 18, 2023Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆174Feb 3, 2023Updated 3 years ago
- 文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法☆2,599May 13, 2024Updated last year
- ☆448Oct 26, 2022Updated 3 years ago
- Learning Named Entity Tagger from Domain-Specific Dictionary☆485Oct 5, 2019Updated 6 years ago
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆541Dec 10, 2021Updated 4 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- 专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference☆622Feb 3, 2021Updated 5 years ago
- ☆124Feb 3, 2019Updated 7 years ago
- ☆266Oct 29, 2020Updated 5 years ago
- 百度开源的依存句法分析系统☆1,003Feb 5, 2023Updated 3 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 11 months ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Nam…☆134Feb 26, 2022Updated 4 years ago
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概 念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示☆171Oct 6, 2018Updated 7 years ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆322Jan 3, 2024Updated 2 years ago
- Reject complicated operations for incorporating lexicon for Chinese NER.☆437Jan 22, 2022Updated 4 years ago
- DeepIE: Deep Learning for Information Extraction☆1,943Dec 9, 2022Updated 3 years ago
- Core Data of HowNet and OpenHowNet Python API☆635Dec 16, 2021Updated 4 years ago
- ccks金融事件主体抽取☆74Oct 21, 2020Updated 5 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA 数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,183Oct 30, 2023Updated 2 years ago
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,419Nov 21, 2022Updated 3 years ago
- keras implement of transformers for humans☆5,421Nov 11, 2024Updated last year
- Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆551Jul 14, 2022Updated 3 years ago
- NLP for human. A fast and easy-to-use natural language processing (NLP) toolkit, satisfying your imagination about NLP.☆286Dec 8, 2022Updated 3 years ago
- [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach☆299Feb 2, 2022Updated 4 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆779Mar 17, 2023Updated 2 years ago