mingruimingrui / ICU-tokenizerLinks
ICU based universal language tokenizer
☆33Updated 3 years ago
Alternatives and similar repositories for ICU-tokenizer
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
Sorting:
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆119Updated 2 years ago
- ☆36Updated 3 years ago
- BERTserini☆26Updated 3 years ago
- ☆92Updated 4 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 3 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 5 years ago
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated 2 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆174Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 4 years ago
- ☆88Updated 4 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 4 years ago
- X-BERT: eXtreme Multi-label Text Classification with BERT☆52Updated 6 years ago
- ☆57Updated 2 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- ☆45Updated 4 years ago
- ☆67Updated 4 years ago
- Generalizing Natural Language Analysis through Span-relation Representations☆91Updated 2 months ago
- Implementation of pQRNN in PyTorch☆46Updated 4 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Updated 4 years ago
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆104Updated 4 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆109Updated 5 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆155Updated 3 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Updated 3 years ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago
- The official code of the "Frustratingly Easy System Combination for Grammatical Error Correction" paper☆56Updated last year