biomedical-translation-corpora / corporaLinks

Parallel corpora for the biomedical domain

☆49

Alternatives and similar repositories for corpora

Users that are interested in corpora are comparing it to the libraries listed below

Sorting:

neulab / contextual-mt
A repository with the code related to experiments around context-aware machine translation
☆50Updated 3 years ago
wmt-conference / wmt-format-tools
Tools for formatting WMT hypothesis and test sets in XML
☆27Updated 2 months ago
wmt-conference / wmt22-news-systems
☆20Updated 2 years ago
wenlai-lavine / m4Adapter
m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (Findings of EMNLP 2022)
☆19Updated 2 years ago
tmramalho / finetune-mbart
How to finetune mbart using fairseq
☆24Updated 4 years ago
spraakbanken / multiged-2023
☆15Updated 2 years ago
THUNLP-MT / Mask-Align
Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021
☆61Updated 4 years ago
lilt / alignment-scripts
Scripts to preprocess training and test data and to run fast_align and giza
☆108Updated 3 years ago
mahfuzibnalam / terminology_evaluation
☆21Updated 3 years ago
rbawden / discourse-mt-test-sets
☆28Updated last year
zliucr / CrossNER
CrossNER: Evaluating Cross-Domain Named Entity Recognition (AAAI-2021)
☆130Updated 4 years ago
wns823 / NMT_SSP
NMT with ssp
☆11Updated 3 years ago
hsing-wang / WMT2020_BioMedical
☆15Updated 3 years ago
sheffieldnlp / mlqe-pe
Multilingual Quality Estimation and Automatic Post-editing Dataset
☆42Updated 3 years ago
roeeaharoni / unsupervised-domain-clusters
Code and data accompanying our ACL 2020 paper, "Unsupervised Domain Clusters in Pretrained Language Models".
☆58Updated 4 years ago
oriram / splinter
☆92Updated 3 years ago
ZurichNLP / coverage-contrastive-conditioning
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…
☆22Updated 2 years ago
google-research-datasets / clang8
cLang-8 is a dataset for grammatical error correction.
☆106Updated 2 years ago
Yale-LILY / dart
Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"
☆154Updated 2 years ago
ZurichNLP / ContraDecode
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆35Updated last year
nttcslab-nlp / word_align
A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT
☆26Updated 4 years ago
evgeniiaraz / datasets_multiling_dialogue
Multilingual Dialogue Datasets
☆19Updated 2 years ago
wmt-conference / wmt21-news-systems
☆24Updated 2 years ago
chaojiang06 / wiki-auto
Neural CRF Model for Sentence Alignment in Text Simplification
☆68Updated 5 months ago
ghchen18 / cdalign
Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"
☆25Updated 2 years ago
jcyk / copyisallyouneed
Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory
☆82Updated 2 years ago
Yale-LILY / QMSum
Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"
☆126Updated last year
google-research / mt-metrics-eval
Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.
☆109Updated 3 months ago
diyiy / ACL2022_Limited_Data_Learning_Tutorial
☆92Updated 3 years ago
lxucs / coref-hoi
PyTorch implementation of the end-to-end coreference resolution model with different higher-order inference methods.
☆60Updated 2 years ago