bojone / labse
Language-agnostic BERT Sentence Embedding (LaBSE)
☆151Updated 4 years ago
Alternatives and similar repositories for labse:
Users that are interested in labse are comparing it to the libraries listed below
- ☆67Updated 3 years ago
- ☆119Updated 4 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆120Updated 4 years ago
- Scripts to preprocess training and test data and to run fast_align and giza☆108Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆230Updated 2 years ago
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 4 years ago
- ☆58Updated last year
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Updated 5 years ago
- AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training☆129Updated 3 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- BERTserini☆26Updated 2 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆108Updated 4 years ago
- This repository contains materials for our tutorial on automatic grammatical error correction: R. Grundkiewicz, C. Bryant, M. Felice: A C…☆39Updated 4 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- Pre-processing and in some cases downloading of datasets for the paper "Content Selection in Deep Learning Models of Summarization."☆78Updated 2 years ago
- ☆87Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆134Updated last year
- Named Entity Recognition as Dependency Parsing☆39Updated 4 years ago
- The official implementation of EMNLP 2020, "A Simple and Effective Model for Answering Multi-span Questions".☆158Updated last year
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- Named Entity Recognition with Pretrained XLM-RoBERTa☆89Updated 3 years ago
- cLang-8 is a dataset for grammatical error correction.☆104Updated 2 years ago
- Models, system configurations and outputs of our winning GEC systems in the BEA 2019 shared task described in R. Grundkiewicz, M. Junczys…☆51Updated 5 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆179Updated 4 years ago
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆89Updated 3 years ago