Parallel corpora for the biomedical domain
☆50Jul 18, 2024Updated last year
Alternatives and similar repositories for corpora
Users that are interested in corpora are comparing it to the libraries listed below
Sorting:
- Chinese to English medical translation☆60May 15, 2021Updated 4 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆37Aug 29, 2025Updated 6 months ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- Domain Adaptation of Neural Machine Translation by Lexicon Induction☆20Jan 3, 2020Updated 6 years ago
- Unsupervised parallel sentence extraction from comparable corpora☆16Aug 6, 2019Updated 6 years ago
- Multi-lingual & multi-domain (specialisation for biomedical data) translation model☆40Nov 17, 2020Updated 5 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"☆14May 30, 2021Updated 4 years ago
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 4 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- Transform TMX to text☆28Nov 23, 2022Updated 3 years ago
- ☆21Dec 30, 2021Updated 4 years ago
- ☆26Jan 9, 2023Updated 3 years ago
- Expert annotated Hallmarks of Cancer Corpus☆21Sep 18, 2018Updated 7 years ago
- Companion toolkit of the 'Serial Speakers' dataset.☆11Feb 17, 2020Updated 6 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Jun 28, 2024Updated last year
- Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022☆20Mar 18, 2022Updated 4 years ago
- The official code implementation of "LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis".☆37Dec 11, 2025Updated 3 months ago
- Code for the ACL2020 paper Character-Level Translation with Self-Attention☆31Oct 15, 2020Updated 5 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Apr 18, 2025Updated 11 months ago
- ☆13Dec 17, 2021Updated 4 years ago
- Running massive simulations using RNNs on CPUs for building bots and all kinds of things.☆13Jun 13, 2021Updated 4 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- Best Practices in Translation Memory Management☆47Dec 14, 2018Updated 7 years ago
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- ☆10Apr 13, 2022Updated 3 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- ☆20Aug 17, 2021Updated 4 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 2 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆53Feb 17, 2021Updated 5 years ago
- ☆10May 16, 2024Updated last year
- "What is Learned in Visually Grounded Neural Syntax Acquisition", Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush and Yoav Artzi (AC…☆12Dec 30, 2021Updated 4 years ago
- ☆16Sep 28, 2023Updated 2 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Jul 25, 2024Updated last year
- ☆34Mar 25, 2023Updated 2 years ago
- ☆17Sep 24, 2024Updated last year
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆25Oct 22, 2022Updated 3 years ago
- Archive Youtube videos and channels☆17Jul 14, 2021Updated 4 years ago