Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation
☆41Aug 8, 2019Updated 6 years ago
Alternatives and similar repositories for noisy-text
Users that are interested in noisy-text are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- coco is an opensource conversation collector. or simply a fitness tracker for your conversations. coco is private by default. it runs on …☆31May 19, 2025Updated 11 months ago
- This repository contains the source code and links to some datasets used in the CoNLL 2019 paper "Learning to Represent Bilingual Diction…☆12Oct 1, 2020Updated 5 years ago
- NMT domain adaptation papers (updating...)☆17Jun 1, 2019Updated 6 years ago
- Implementation of "Modeling Past and Future for Neural Machine Translation"☆15Mar 16, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- Neural network language models, including feed-forward neural network, recurrent neural network, long-short term memory neural network.☆11Aug 3, 2017Updated 8 years ago
- Code Repository for the IndicXNLI paper.☆15Jul 8, 2023Updated 2 years ago
- ☆11May 25, 2023Updated 2 years ago
- A tool to generate causal DAGs from syslog time-series.☆13Nov 7, 2023Updated 2 years ago
- A simple and readable neural machine translation system☆24Mar 6, 2022Updated 4 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆67May 16, 2019Updated 6 years ago
- Code for our SIGKDD'25 paper BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models.☆25May 26, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆33Jun 20, 2018Updated 7 years ago
- [MIA'20] Semi-supervised WCE Image Classification with Adaptive Aggregated Attention☆13Dec 30, 2020Updated 5 years ago
- Dataset and codes for the paper "Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training".☆25Mar 6, 2022Updated 4 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 8 months ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- PyTorch implementation of FAIR's paper "End-to-End Memory Network", NIPS 2015☆12Oct 19, 2017Updated 8 years ago
- Repository for Content-Aware Transformer☆16Feb 20, 2023Updated 3 years ago
- Training a sign language detection model☆11Aug 26, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- UrIII Period (Sumerian Language) Information Extraction pipeline including, Named Entity Recognition, Part Of Speech Tagging and Machine …☆31Apr 6, 2025Updated last year
- 🪱 PARASITE || A parallel sentence data preprocessing toolkit. Originally developed as a part of the `en-ru` winner submission of WMT20 B…☆11Jun 8, 2021Updated 4 years ago
- A Simple Flask App to interact with your Machine Translation Model☆13Feb 26, 2020Updated 6 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"☆12Dec 17, 2021Updated 4 years ago
- Code for "Unsupervised Cross-lingual Transfer of Word Embedding Spaces" in EMNLP 2018☆24Dec 29, 2018Updated 7 years ago
- A set of command-line tools to preprocess videos for sign language analysis☆14Aug 20, 2025Updated 8 months ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- ☆12Oct 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A PyTorch implementation of the paper Generative Adversarial Text-to-Image Synthesis☆25Nov 6, 2019Updated 6 years ago
- A benchmark for language models based on the UK Linguistics Olympiad☆10Mar 3, 2025Updated last year
- ☆21Nov 27, 2025Updated 5 months ago
- Anomaly detection from OS logs using Transformers implemented with Pytorch.☆20Dec 16, 2020Updated 5 years ago
- ☆18Mar 3, 2025Updated last year
- Code for NAACL-19 paper "Relation Extraction with Temporal Reasoning Based on Memory Augmented Distant Supervision"☆10Aug 26, 2019Updated 6 years ago
- a ducttape workflow for neural machine translation☆14Mar 23, 2021Updated 5 years ago