Maryam-Nasseri / LCA-AW-Lexical-Complexity-Analyzer-for-Academic-WritingLinks
LCA-AW (Lexical Complexity Analyzer for Academic Writing, Nasseri and Lu, 2019); version 2.1. This code is a modified version of the LCA (lexical complexity analyzer, described in Lu, 2012). The modified version integrated the BAWE (British Academic Written English) corpus' word list, the bawe_list.txt, that is a list of most frequently-used aca…
☆10Updated 5 years ago
Alternatives and similar repositories for LCA-AW-Lexical-Complexity-Analyzer-for-Academic-Writing
Users that are interested in LCA-AW-Lexical-Complexity-Analyzer-for-Academic-Writing are comparing it to the libraries listed below
Sorting:
- Dictionary for Cantonese word segmentation☆38Updated last year
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆54Updated 4 months ago
- L2SCA & LCA fork: cross-platform, GUI, without Java dependency☆41Updated 9 months ago
- Conversion of UD_Chinese-GSD to simplified Chinese characters.☆38Updated last month
- This is a code example repo for the NLP course offered by the Institute of Chinese Information Processing of BNU.☆50Updated 8 months ago
- Pre-trained ELECTRA from Hong Kong data☆29Updated 5 years ago
- Keywords: lexical diversity MTLD HDD vocabulary type token python☆17Updated 8 years ago
- 粤语分词工具☆48Updated 7 years ago
- the English Language Learner Insight, Proficiency and Skills Evaluation (ELLIPSE) Corpus☆22Updated last month
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆18Updated 3 weeks ago
- ☆29Updated last month
- AlphaReadabilityChinese is a tool that calculates the readability of Chinese texts, which includes indices at lexical, syntactic, and sem…☆37Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆93Updated 4 years ago
- The spoken L1 corpus represents present-day spoken Chinese (Putonghua) used in mainland China, which is designed as a comparable corpus t…☆22Updated 4 years ago
- 中文分词软件基准测试 | Chinese tokenizer benchmark☆25Updated 7 years ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆258Updated 5 months ago
- AMI Meeting Parallel Corpus☆11Updated 5 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆68Updated 4 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 5 years ago
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆19Updated 2 years ago
- fastText vectors created from Hong Kong data.☆22Updated 5 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Updated 5 years ago
- 渊 - A project for Classical Chinese☆110Updated 3 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆82Updated 2 months ago
- Converts between traditional and simplified Chinese☆32Updated last year
- 一个面向繁体中文古籍分词的python工具包☆36Updated 4 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆38Updated 3 months ago
- A toolset for computation and comparison of Chinese dialects☆45Updated last month
- The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).☆242Updated 8 months ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆150Updated 2 years ago