Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation
☆41Aug 8, 2019Updated 6 years ago
Alternatives and similar repositories for noisy-text
Users that are interested in noisy-text are comparing it to the libraries listed below
Sorting:
- Explicit Sentence Compression for Neural Machine Translation☆10May 12, 2020Updated 5 years ago
- This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…☆12Oct 1, 2020Updated 5 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Neutron: A pytorch based implementation of Transformer and its variants.☆64Aug 10, 2023Updated 2 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Mar 16, 2018Updated 7 years ago
- Exploring the Limits of Low-Resource Neural Machine Translation☆34Feb 16, 2023Updated 3 years ago
- ☆18May 15, 2021Updated 4 years ago
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020☆16Mar 21, 2025Updated 11 months ago
- NMT domain adaptation papers (updating...)☆17Jun 1, 2019Updated 6 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- A simple and readable neural machine translation system☆24Mar 6, 2022Updated 3 years ago
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 5 years ago
- Instruction to data diversification☆24Nov 24, 2020Updated 5 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Oct 1, 2020Updated 5 years ago
- This is a fork of the awesome Joey-NMT with Reinforcement Learning algorithms like Policy Gradient, MRT and Advantage Actor Critic.☆27Feb 10, 2023Updated 3 years ago
- ACL19_Depth_Growing_for_Neural_Machine_Translation☆23Jul 6, 2019Updated 6 years ago
- The Berkeley Word Aligner☆23Mar 24, 2016Updated 9 years ago
- A PyTorch implementation of the paper Generative Adversarial Text-to-Image Synthesis☆25Nov 6, 2019Updated 6 years ago
- Reversible tokenization in Python.☆60Aug 21, 2018Updated 7 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Code for "Unsupervised Cross-lingual Transfer of Word Embedding Spaces" in EMNLP 2018☆24Dec 29, 2018Updated 7 years ago
- name2nat: a Python package for nationality prediction from a name☆115Oct 14, 2020Updated 5 years ago
- An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical…☆28Jul 25, 2024Updated last year
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆36Aug 29, 2025Updated 6 months ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- Plexus - Interactive Emotion Visualization based on Social Media☆29Aug 12, 2019Updated 6 years ago
- Pre-trained Machine Translation Models of Korean from/to ECJ☆29Jul 15, 2019Updated 6 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- Scripts and noise data for Belinkov & Bisk 2018☆29Apr 27, 2018Updated 7 years ago
- Experimenting with GANs in Tensorflow/Keras☆10Jan 13, 2022Updated 4 years ago
- Simple, beautiful discussion forums - for customer support, news aggregation, QA sites, and online communities.☆56Dec 9, 2012Updated 13 years ago
- Neural network language models, including feed-forward neural network, recurrent neural network, long-short term memory neural network.☆11Aug 3, 2017Updated 8 years ago
- ☆11May 25, 2023Updated 2 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- Masakhane Web is a translation web application for solely African Languages.☆37Aug 11, 2023Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- [ACL 2024] Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models☆41Jun 4, 2024Updated last year