toastynews / electra-hongkongeseLinks
Pre-trained ELECTRA from Hong Kong data
☆29Updated 5 years ago
Alternatives and similar repositories for electra-hongkongese
Users that are interested in electra-hongkongese are comparing it to the libraries listed below
Sorting:
- 粤语分词工具☆48Updated 7 years ago
- Dictionary for Cantonese word segmentation☆37Updated last year
- Codes for our paper "SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge" (EMNLP 2020)☆80Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- Add CRF or LSTM+CRF for huggingface transformers bert to perform better on NER task. It is very simple to use and very convenient to cust…☆62Updated 3 years ago
- Code for "A BERT-based Distractor Generation Scheme with Multi-tasking and Negative Answer Training Strategies."☆27Updated 3 years ago
- 🍳 NLPrep - dataset tool for many natural language processing task☆28Updated 4 years ago
- A CWN Python binding with graph structure☆35Updated 2 years ago
- ☆67Updated 3 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆109Updated 4 years ago
- Keywords: lexical diversity MTLD HDD vocabulary type token python☆17Updated 8 years ago
- EmbedRank implemented in Python.☆15Updated last year
- ☆75Updated 2 years ago
- 轉換好的 Albert 中文模型 (for pytorch-transformers)☆19Updated 5 years ago
- Code for the ACL 2020 paper 'tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection'.☆143Updated 2 years ago
- Cantonese segmentation tool 粵語分詞工具☆30Updated 5 years ago
- Coherence-Aware Text Segmentation tool, used to perform text segmentation.☆29Updated 3 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆63Updated 3 years ago
- ☆120Updated 5 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Fine tuning bert for text generation☆37Updated 5 years ago
- Implementation of Nested Named Entity Recognition using BERT☆136Updated 4 years ago
- 中文生成式预训练模型☆99Updated 5 years ago
- Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.☆18Updated 2 years ago
- XED multilingual emotion datasets☆64Updated 2 years ago
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 4 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Updated 2 years ago
- A Chinese Long Text Summarization Dataset☆74Updated 3 years ago
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆214Updated 2 years ago