PhilipMay / stsb-multi-mtLinks
Machine translated multilingual STS benchmark dataset.
☆32Updated last year
Alternatives and similar repositories for stsb-multi-mt
Users that are interested in stsb-multi-mt are comparing it to the libraries listed below
Sorting:
- Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.☆185Updated 2 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆107Updated last year
- A simple implementation of SimCSE☆77Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆173Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- MoverScore: Text Generation Evaluating with Contextualized Embeddings and Earth Mover Distance☆209Updated last year
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆131Updated 3 years ago
- BERTserini☆26Updated 2 years ago
- A library for building hierarchical text representation and corresponding downstream applications.☆79Updated last year
- OpenNER: A toolkit for open-domain named entity recognition☆25Updated last year
- A dataset and baselines for CLS.☆12Updated 3 years ago
- ☆60Updated 2 years ago
- Automatically detect errors in annotated corpora.☆47Updated last year
- This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"☆139Updated 5 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆79Updated 3 years ago
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆52Updated 3 weeks ago
- Code for "MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER"☆48Updated 3 years ago
- Build Text Rerankers with Deep Language Models☆262Updated last year
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 4 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆104Updated 2 years ago
- 基于中心度的中文关键短语抽取工具☆11Updated 3 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆134Updated 3 weeks ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆17Updated 4 months ago
- ☆46Updated 3 years ago
- The unified platform for data-related resources.☆134Updated 2 years ago
- Source codes of Neural Quality Estimation with Multiple Hypotheses for Grammatical Error Correction☆43Updated 4 years ago
- ☆28Updated 4 months ago
- cLang-8 is a dataset for grammatical error correction.☆107Updated 3 years ago
- [ACL-IJCNLP 2021] Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning☆93Updated 2 years ago
- ☆11Updated 3 years ago