yuikns / icwb2-dataView external linksLinks
This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also included is the script used to score the results submitted by the bakeoff participants and the simple segmenter used to generate the baseline and topline data.
☆67May 23, 2018Updated 7 years ago
Alternatives and similar repositories for icwb2-data
Users that are interested in icwb2-data are comparing it to the libraries listed below
Sorting:
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- ☆16Jun 26, 2025Updated 7 months ago
- ☆16Jun 19, 2020Updated 5 years ago
- 采用bert进行事件抽取,[cls]进行事件分类,最后一层向量进行序列标注,两个任务同时训练。☆13Jun 7, 2021Updated 4 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 3 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- A transformer seq2seq model to generate couplets. 一个写对联的 Transformer 序列到序列模型。☆17Feb 1, 2019Updated 7 years ago
- CCL2024评测任务-古文历史事件类型抽取评测(CHED2024)☆18May 22, 2024Updated last year
- Python NLP Reading Notebook By DUTIR Searh Engine Group☆18Sep 19, 2018Updated 7 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- Topic Detection and Tracking☆19Apr 21, 2015Updated 10 years ago
- Implement en-fr translation task by implenting seq2seq, encoder-decoder in RNN layers with Attention mechanism and Beamsearch inference d…☆21Feb 14, 2018Updated 8 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Sep 18, 2020Updated 5 years ago
- ☆96Nov 12, 2025Updated 3 months ago
- 基于 Bi-LSTM 和 CRF 的中文语义角色标注☆88Jun 4, 2019Updated 6 years ago
- 基于LDA和TextRank的关键子提取算法实现☆23Aug 11, 2017Updated 8 years ago
- Chinese word segmentation in tensorflow 2.x☆23Mar 25, 2023Updated 2 years ago
- pytorch Efficient GlobalPointer☆56Apr 12, 2022Updated 3 years ago
- ☆55Apr 7, 2022Updated 3 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- 文本分类基准测试☆25Mar 29, 2018Updated 7 years ago
- [CCKS2022 ] Multimodal Event Detection and Argument Extraction.☆31Dec 4, 2022Updated 3 years ago
- 基于Pytorch+BERT+CRF的NLP序列标注模型,目前包括分词,词性标注,命名实体识别等☆62Dec 8, 2022Updated 3 years ago
- My third project in NLP classes.☆27Dec 8, 2017Updated 8 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆175Mar 26, 2019Updated 6 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Mar 14, 2020Updated 5 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- The very easy BERT pretrain process by using tokenizers and transformers repos☆32Feb 27, 2020Updated 5 years ago
- ☆29May 30, 2019Updated 6 years ago
- Code accompanying End-to-End Information Extraction without Token-Level Supervision☆37Jul 14, 2017Updated 8 years ago
- Chrome extension for massive add to cart for kobo.com☆10Sep 7, 2020Updated 5 years ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- Adversarial Transfer Learning for Chinese Named Entity Recognition with Self-Attention Mechanism☆202Oct 29, 2018Updated 7 years ago
- Graph Based Multi-sentences Compression Algorithm.☆31Oct 15, 2017Updated 8 years ago
- A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)☆448Jun 15, 2022Updated 3 years ago
- A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .☆217Jul 28, 2022Updated 3 years ago
- Salesforce + Elastic Stack connector☆10Feb 5, 2025Updated last year
- 【python】利用百度语音识别API,百度语音合成API,图灵机器人API实现简单的对话机器人☆10Mar 13, 2021Updated 4 years ago