mingruimingrui / ICU-tokenizerLinks
ICU based universal language tokenizer
☆33Updated 4 years ago
Alternatives and similar repositories for ICU-tokenizer
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
Sorting:
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- X-BERT: eXtreme Multi-label Text Classification with BERT☆52Updated 6 years ago
- ☆36Updated 3 years ago
- BERTserini☆27Updated 3 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆96Updated 2 years ago
- A library to conduct ranking experiments with transformers.☆160Updated 2 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆57Updated last year
- Scripts to preprocess training and test data and to run fast_align and giza☆107Updated 4 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated 5 months ago
- ☆92Updated 4 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated 2 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆96Updated 3 years ago
- ☆88Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Updated 4 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆57Updated 4 years ago
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆89Updated 4 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 4 months ago
- source code of bison☆26Updated 5 years ago
- Resources for the "CTRLsum: Towards Generic Controllable Text Summarization" paper☆147Updated 8 months ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 4 years ago
- ☆61Updated last year
- Code for A Hierarchical Model for Data-to-Text Generation (Rebuffel, Soulier, Scoutheeten, Gallinari; ECIR 2020)☆81Updated 2 years ago
- ☆42Updated 5 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆110Updated 5 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆71Updated 4 years ago
- ☆162Updated 5 years ago