This directory contains the training, test, and gold-standard data used in the 2nd International Chinese Word Segmentation Bakeoff. Also included is the script used to score the results submitted by the bakeoff participants and the simple segmenter used to generate the baseline and topline data.
☆67May 23, 2018Updated 8 years ago
Alternatives and similar repositories for icwb2-data
Users that are interested in icwb2-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python NLP Reading Notebook By DUTIR Searh Engine Group☆18Sep 19, 2018Updated 7 years ago
- Tensorflow implementation of a Neural Attention Model for Abstractive Summarization.☆10Jul 20, 2020Updated 5 years ago
- Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]☆16Jan 28, 2022Updated 4 years ago
- ☆21Jun 26, 2025Updated 11 months ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My third project in NLP classes.☆27Dec 8, 2017Updated 8 years ago
- 采用bert进行事件抽取,[cls]进行事件分类,最后一层向量进行序列标注,两个任务同时训练。☆12Jun 7, 2021Updated 5 years ago
- ☆14Apr 6, 2025Updated last year
- ☆16Jun 19, 2020Updated 5 years ago
- ☆15Mar 31, 2021Updated 5 years ago
- 流浪地球影评数据分析☆10Feb 10, 2019Updated 7 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- Topic Detection and Tracking☆19Apr 21, 2015Updated 11 years ago
- OCNLI: 中文原版自然语言推理任务☆166Sep 23, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Challenge on Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG), Co-located with SLT2024 FutureDial-RAG Challenge☆11Aug 10, 2024Updated last year
- Chinese word segmentation algorithm without corpus(无需语料库的中文分词)☆499Sep 3, 2020Updated 5 years ago
- 依存句法关系之三元组提取方法示例☆12May 30, 2017Updated 9 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆302Aug 12, 2020Updated 5 years ago
- Keyphrase Extraction from Scholarly Documents - Thesis☆14Nov 3, 2021Updated 4 years ago
- word2vec wordembedding embedding google☆12Aug 15, 2017Updated 8 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- ☆97May 6, 2026Updated last month
- Prototype implementation of an architecture suggested in Robot Dream paper (http://arxiv.org/abs/1603.03007)☆12Jul 3, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 基于 Bi-LSTM 和 CRF 的中文语义角色标注☆88Jun 4, 2019Updated 7 years ago
- Build and visualize the word2vec model on sogou news data(SogouCS)☆13Mar 3, 2018Updated 8 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Named entity recognition system using multi-stage CRF and statistical rules☆11Oct 3, 2016Updated 9 years ago
- python CRF++实现分词☆37Jun 19, 2018Updated 7 years ago
- 使用TensorFlow2.0中的Keras实现基于BiLSTM-CRF的NER☆15Sep 5, 2020Updated 5 years ago
- 文本分类基准测试☆25Mar 29, 2018Updated 8 years ago
- ☆14Feb 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 爬取中国所有省份办公厅公文数据。Crawler for all Policy text of all provinces in China☆24Dec 27, 2020Updated 5 years ago
- Graph-based Multi-sentence Compression Implementation☆30Apr 21, 2026Updated last month
- 一个 安卓 苹果 平台的 吉林大学教务系统 课表 成绩 考试时间 显示前端☆26Jan 13, 2026Updated 4 months ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- 本项目使用Keras实现Transformer模型来进行文本分类(中文、英文均支持)。☆12Mar 31, 2022Updated 4 years ago
- Python 6,113 Updated 9 days ago MLiA_SourceCode 机器学习实战----十大经典算法☆13Jan 14, 2019Updated 7 years ago
- Machine Translation(cn2en)☆15Nov 6, 2019Updated 6 years ago