mingruimingrui / ICU-tokenizer
ICU based universal language tokenizer
☆31Updated 3 years ago
Alternatives and similar repositories for ICU-tokenizer:
Users that are interested in ICU-tokenizer are comparing it to the libraries listed below
- ☆87Updated 3 years ago
- NoiseMix - data generation for natural language☆40Updated 6 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆151Updated 4 years ago
- ☆46Updated 3 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- SlotRefine: A Fast Non-Autoregressive Model forJoint Intent Detection and Slot Filling☆48Updated 3 years ago
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answering☆44Updated 3 years ago
- Code for ACL2021 paper: "GLGE: A New General Language Generation Evaluation Benchmark"☆58Updated 2 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Updated 3 years ago
- ☆36Updated 2 years ago
- ☆42Updated 4 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Updated 2 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago
- ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences☆28Updated last year
- evaluation suite for testing automatic grammatical error corrections☆38Updated 7 years ago
- ☆92Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- ☆68Updated 3 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- ☆37Updated 3 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- Word sense disambiguation using contextualized word embedding☆17Updated 5 years ago
- ☆66Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago