valentinmace/noisy-text

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/valentinmace/noisy-text)

valentinmace / noisy-text

Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation

☆41

Alternatives and similar repositories for noisy-text

Users that are interested in noisy-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Roxot / mbr-nmt
View on GitHub
Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation
☆16Oct 14, 2022Updated 3 years ago
hfxunlp / transformer
View on GitHub
Neutron: A pytorch based implementation of Transformer and its variants.
☆65Aug 10, 2023Updated 2 years ago
muhaochen / bilingual_dictionaries
View on GitHub
This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…
☆12Oct 1, 2020Updated 5 years ago
yhy1117 / DA4NMT
View on GitHub
NMT domain adaptation papers (updating...)
☆17Jun 1, 2019Updated 7 years ago
zhengzx-nlp / past-and-future-nmt
View on GitHub
Implementation of "Modeling Past and Future for Neural Machine Translation"
☆15Mar 16, 2018Updated 8 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
danielqingz / spiders
View on GitHub
爬虫：用于爬取百度百科中英语料、东方财富网财报、医学NER中英语料，可实现Deepl多语自动翻译
☆13Jul 7, 2021Updated 5 years ago
ZurichNLP / ContraWSD
View on GitHub
Word sense disambiguation test sets for NMT
☆21Dec 3, 2020Updated 5 years ago
dengliangshi / pynnlms
View on GitHub
Neural network language models, including feed-forward neural network, recurrent neural network, long-short term memory neural network.
☆11Aug 3, 2017Updated 8 years ago
cdli-gh / Semi-Supervised-NMT-for-Sumerian-English
View on GitHub
Exploring the Limits of Low-Resource Neural Machine Translation
☆34Feb 16, 2023Updated 3 years ago
heartcored98 / transformer_anatomy
View on GitHub
Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020
☆16Mar 21, 2025Updated last year
divyanshuaggarwal / IndicXNLI
View on GitHub
Code Repository for the IndicXNLI paper.
☆15Jul 8, 2023Updated 3 years ago
nxphi47 / data_diversification
View on GitHub
Instruction to data diversification
☆24Nov 24, 2020Updated 5 years ago
wszlong / sb-nmt
View on GitHub
Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)
☆67May 16, 2019Updated 7 years ago
sumanbanerjee1 / Code-Mixed-Dialog
View on GitHub
☆33Jun 20, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hannlp / SimpleNMT
View on GitHub
A simple and readable neural machine translation system
☆24Mar 6, 2022Updated 4 years ago
kevindegila / flask-joey
View on GitHub
A Simple Flask App to interact with your Machine Translation Model
☆13Feb 26, 2020Updated 6 years ago
ZurichNLP / ContraDecode
View on GitHub
The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…
☆38Aug 29, 2025Updated 10 months ago
apeterswu / Depth_Growing_NMT
View on GitHub
ACL19_Depth_Growing_for_Neural_Machine_Translation
☆23Jul 6, 2019Updated 7 years ago
guillaume-be / SentencePiece-Rust-example
View on GitHub
Supporting example for "A Rust SentencePiece implementation"
☆20Jun 7, 2020Updated 6 years ago
iosonofabio / seqanpy
View on GitHub
Fast pairwise sequence alignment using SeqAn, in Python.
☆13Mar 23, 2019Updated 7 years ago
kuc2477 / pytorch-memn2n
View on GitHub
PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015
☆12Oct 19, 2017Updated 8 years ago
vipulgupta1011 / CALM
View on GitHub
☆11Oct 2, 2023Updated 2 years ago
sign-language-processing / detection-train
View on GitHub
Training a sign language detection model
☆11May 10, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ImperialNLP / MMT-Delib
View on GitHub
☆10Dec 21, 2022Updated 3 years ago
cdli-gh / Sumerian-Translation-Pipeline
View on GitHub
UrIII Period (Sumerian Language) Information Extraction pipeline including, Named Entity Recognition, Part Of Speech Tagging and Machine …
☆31Apr 6, 2025Updated last year
prakashpandey9 / Text2Image-PyTorch
View on GitHub
A PyTorch implementation of the paper Generative Adversarial Text-to-Image Synthesis
☆25Nov 6, 2019Updated 6 years ago
Yifan-Gao / open_retrieval_conversational_machine_reading
View on GitHub
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Nov 19, 2022Updated 3 years ago
YerevaNN / PARASITE
View on GitHub
🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…
☆11Jun 8, 2021Updated 5 years ago
nutcrtnk / DHGNet
View on GitHub
Code for paper "Cross-lingual Transfer for Text Classification with Dictionary-based Heterogeneous Graph", EMNLP 2021 - findings.
☆13Dec 14, 2021Updated 4 years ago
LinuxSuRen / yaml-readme
View on GitHub
A helper to generate the READE file automatically from YAML-based metadata files.
☆19May 23, 2024Updated 2 years ago
am-bean / lingOly
View on GitHub
A benchmark for language models based on the UK Linguistics Olympiad
☆12Mar 3, 2025Updated last year
ChiyuSONG / dynamics-of-instruction-tuning
View on GitHub
☆18Mar 3, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
shuoyangd / tape4nmt
View on GitHub
a ducttape workflow for neural machine translation
☆14Mar 23, 2021Updated 5 years ago
wxjiao / UncSamp
View on GitHub
Implementation of our paper "Self-training Sampling with Monolingual Data Uncertainty for Neural Machine Translation" to appear in ACL-20…
☆31Jul 16, 2021Updated 5 years ago
ElliottYan / DS_Temporal
View on GitHub
Code for NAACL-19 paper "Relation Extraction with Temporal Reasoning Based on Memory Augmented Distant Supervision"
☆10Aug 26, 2019Updated 6 years ago
syuqings / Fashion-MMT
View on GitHub
Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".
☆25Mar 6, 2022Updated 4 years ago
hanbin973 / scIntegral
View on GitHub
Highly scalable integration and classification of single-cell RNA sequencing data
☆11Dec 27, 2020Updated 5 years ago
sachink1729 / SQL-Agents-Using-RAG-DSPy-Groq
View on GitHub
Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs
☆16Aug 23, 2024Updated last year