mingruimingrui / ICU-tokenizerLinks
ICU based universal language tokenizer
☆33Updated 3 years ago
Alternatives and similar repositories for ICU-tokenizer
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
Sorting:
- ☆88Updated 3 years ago
- ☆36Updated 3 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- Coreference resolution with different higher-order inference methods; implemented in PyTorch.☆36Updated 2 years ago
- X-BERT: eXtreme Multi-label Text Classification with BERT☆52Updated 6 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆95Updated 2 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated last month
- The Definition Extraction From Text corpus and relevant formatting scripts☆81Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Pytorch implementation of Highly Parallel Autoregressive Entity Linking with Discriminative Correction☆67Updated 3 years ago
- A template for starting a new allennlp project using config files and `allennlp train`☆38Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 5 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 5 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆97Updated 2 years ago
- ☆60Updated last year
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated 2 months ago
- ☆92Updated 4 years ago
- A data explorer for DROP dataset☆22Updated 2 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 5 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆177Updated last year
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆51Updated 4 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Updated 4 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Updated 2 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆112Updated 4 years ago
- Massively Multilingual Transfer for NER☆86Updated 4 years ago