mingruimingrui / ICU-tokenizerLinks
ICU based universal language tokenizer
☆33Updated 3 years ago
Alternatives and similar repositories for ICU-tokenizer
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
Sorting:
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- ☆36Updated 3 years ago
- X-BERT: eXtreme Multi-label Text Classification with BERT☆52Updated 6 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- ☆87Updated 3 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 4 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆97Updated 2 years ago
- ☆92Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 5 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆130Updated 3 years ago
- Source code for EMNLP 2020 paper "Coreferential Reasoning Learning for Language Representation"☆68Updated 3 years ago
- ☆61Updated 2 years ago
- BERTserini☆26Updated 2 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- ☆18Updated 4 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 5 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 4 years ago
- The Definition Extraction From Text corpus and relevant formatting scripts☆81Updated 2 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated 2 weeks ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated 2 years ago
- ☆46Updated 4 years ago
- A template for starting a new allennlp project using config files and `allennlp train`☆38Updated last year
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆90Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated 2 years ago
- GMEG☆30Updated 10 months ago
- ☆16Updated 4 years ago