mingruimingrui / ICU-tokenizerLinks
ICU based universal language tokenizer
☆32Updated 3 years ago
Alternatives and similar repositories for ICU-tokenizer
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
Sorting:
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆87Updated 3 years ago
- Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data☆100Updated last year
- ☆36Updated 2 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆71Updated 2 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆117Updated last year
- ☆42Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆151Updated 4 years ago
- A text augmentation tool for named entity recognition.☆53Updated 3 years ago
- This is the repository for SemEval 2021 Task 4: Reading Comprehension of Abstract Meaning. It includes code for baseline models and data.☆29Updated 3 years ago
- Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.☆72Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 4 years ago
- PyTorch port of BERT ML model☆16Updated 6 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 3 years ago
- Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling☆47Updated 5 years ago
- Named Entity Recognition as Dependency Parsing☆39Updated 4 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 5 years ago
- FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension☆36Updated 2 years ago
- Source code for our "TitleStylist" paper at ACL 2020☆76Updated 11 months ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago
- ☆67Updated 3 years ago
- ☆92Updated 3 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆32Updated last year
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- ☆17Updated 5 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 2 years ago
- code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…☆97Updated 2 years ago
- Selections from EMNLP 2020☆59Updated 4 years ago