toastynews / electra-hongkongese
Pre-trained ELECTRA from Hong Kong data
☆27Updated 4 years ago
Alternatives and similar repositories for electra-hongkongese:
Users that are interested in electra-hongkongese are comparing it to the libraries listed below
- Dictionary for Cantonese word segmentation☆34Updated 8 months ago
- fastText vectors created from Hong Kong data.☆21Updated 4 years ago
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairs☆35Updated 5 years ago
- 粤语分词工具☆46Updated 6 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 3 years ago
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆10Updated 4 years ago
- Transformers for Cantonese☆56Updated 4 years ago
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆18Updated 4 years ago
- Cantonese segmentation tool 粵語分詞工具☆29Updated 4 years ago
- We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training s…☆80Updated 4 years ago
- The enhanced version of ZEN, larger and more powerful.☆28Updated 2 years ago
- Chinese version code for the paper "EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks"☆11Updated 5 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 2 years ago
- Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)☆80Updated 3 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆118Updated 4 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- A Chinese Long Text Summarization Dataset☆68Updated 2 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- ☆66Updated 2 years ago
- 中文生成式预训练模型☆98Updated 4 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆41Updated 3 years ago
- Add CRF or LSTM+CRF for huggingface transformers bert to perform better on NER task. It is very simple to use and very convenient to cust…☆63Updated 3 years ago
- Fine tuning bert for text generation☆37Updated 5 years ago
- CCL2020,“小牛杯”幽默计算任务数据发布☆22Updated 5 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆86Updated 3 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Updated 3 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆22Updated 2 years ago
- Spoken Cantonese from Hong Kong.☆29Updated 3 months ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆75Updated 4 years ago