CIRCSE / LT4HALA
<u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA)</a></u>
☆33Updated 8 months ago
Alternatives and similar repositories for LT4HALA:
Users that are interested in LT4HALA are comparing it to the libraries listed below
- 一个 面向繁体中文古籍分词的python工具包☆32Updated 3 years ago
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆47Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated 3 months ago
- ☆17Updated 7 years ago
- ☆28Updated 3 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 3 months ago
- Evaluation of Natural Language Processing (NLP) tools for the Ancient Chinese language☆35Updated this week
- An evaluation bentchmark for classical Chinese☆12Updated last year
- ☆33Updated 2 years ago
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆12Updated last year
- A Benchmark for Classical Chinese Based on a Crowdsourcing System.☆55Updated 3 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆22Updated 2 years ago
- Ancient Chinese Corpus with Word Sense Annotation☆46Updated 8 months ago
- Code and data for the paper "Time-Aware Ancient Chinese Text Translation and Inference" (LChange'21).☆7Updated 3 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆121Updated last year
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- ☆25Updated last year
- A dataset and baselines for CLS.☆11Updated 2 years ago
- Chinese AMR Corpus☆35Updated last month
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆18Updated 3 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆17Updated 3 years ago
- Code for "MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER"☆47Updated 2 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆13Updated 4 years ago
- ☆12Updated 2 years ago
- An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group☆48Updated 6 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆19Updated 2 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 4 years ago
- 使用LSTM进行端到端的语义角色标注(theano)☆53Updated 5 years ago
- ☆46Updated 3 years ago