This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also included is the script used to score the results submitted by the bakeoff participants and the simple segmenter used to generate the baseline and topline data.
☆67May 23, 2018Updated 7 years ago
Alternatives and similar repositories for icwb2-data
Users that are interested in icwb2-data are comparing it to the libraries listed below
Sorting:
- Python NLP Reading Notebook By DUTIR Searh Engine Group☆18Sep 19, 2018Updated 7 years ago
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- ☆18Jun 26, 2025Updated 8 months ago
- My third project in NLP classes.☆27Dec 8, 2017Updated 8 years ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- 采用bert进行事件抽取,[cls]进行事件分类,最后一层向量进行序列标注,两个任务同时训练。☆13Jun 7, 2021Updated 4 years ago
- ☆16Jun 19, 2020Updated 5 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Sep 18, 2020Updated 5 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- CCL2024评测任务-古文历史事件类型抽取评测(CHED2024)☆19May 22, 2024Updated last year
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆500Sep 3, 2020Updated 5 years ago
- A transformer seq2seq model to generate couplets. 一个写对联的 Transformer 序列到序列模型。☆17Feb 1, 2019Updated 7 years ago
- 依存句法关系之三元组提取方法示例☆12May 30, 2017Updated 8 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆303Aug 12, 2020Updated 5 years ago
- Keyphrase Extraction from Scholarly Documents - Thesis☆14Nov 3, 2021Updated 4 years ago
- ☆96Nov 12, 2025Updated 4 months ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- Build and visualize the word2vec model on sogou news data(SogouCS)☆13Mar 3, 2018Updated 8 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- deploy sentiment classification model based lstm on Tensorflow serving☆10Sep 13, 2018Updated 7 years ago
- Named entity recognition system using multi-stage CRF and statistical rules☆12Oct 3, 2016Updated 9 years ago
- python CRF++实现分词☆37Jun 19, 2018Updated 7 years ago
- 文本分类基准测试☆25Mar 29, 2018Updated 7 years ago
- This package includes some extra functions to matplotlib.☆11May 10, 2022Updated 3 years ago
- 使用TensorFlow2.0中的Keras实现基于BiLSTM-CRF的NER☆15Sep 5, 2020Updated 5 years ago
- 基于苏剑林项目的复用,应用于金融事件关系抽取☆10Mar 26, 2021Updated 4 years ago
- ☆14Feb 23, 2024Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- Machine Translation(cn2en)☆15Nov 6, 2019Updated 6 years ago
- Incremental Learning the Hierarchical Softmax Function for Neural Language Models☆11Dec 6, 2016Updated 9 years ago
- Performance comparison between Chinese word segmentation and part-of-speech tagging tools☆59Jul 4, 2019Updated 6 years ago
- gensim-fast2vec改造、灵活使用大规模外部词向量(具备OOV查询能力)☆23Jun 3, 2019Updated 6 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Beam search for neural network sequence to sequence (encoder-decoder) models.☆34Apr 4, 2019Updated 6 years ago
- repository of mine document☆10Jun 3, 2023Updated 2 years ago
- Chinese word segmentation in tensorflow 2.x☆23Mar 25, 2023Updated 2 years ago