hangyav/UnsupPSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hangyav/UnsupPSE)

hangyav / UnsupPSE

Unsupervised parallel sentence extraction from comparable corpora

☆16

Alternatives and similar repositories for UnsupPSE

Users that are interested in UnsupPSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jind11 / DAMT
View on GitHub
Semi-supervised Domain Adaptation of Machine Translation
☆12Dec 8, 2022Updated 3 years ago
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
Kyubyong / cjk_trans
View on GitHub
Pre-trained Machine Translation Models of Korean from/to ECJ
☆28Jul 15, 2019Updated 7 years ago
ZurichNLP / domain-robustness
View on GitHub
☆13Dec 11, 2020Updated 5 years ago
helang1991 / ClearTool
View on GitHub
android 6.0 clean memory and cache
☆10Dec 24, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qhungngo / EVBCorpus
View on GitHub
The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.
☆52Jul 12, 2019Updated 7 years ago
matbahasa / TALPCo
View on GitHub
TUFS Asian Language Parallel Corpus
☆53May 1, 2023Updated 3 years ago
THUNLP-MT / Template-NMT
View on GitHub
☆23Nov 15, 2022Updated 3 years ago
shuo-git / VecConstNMT
View on GitHub
☆25Oct 22, 2022Updated 3 years ago
ncg-task / training-data
View on GitHub
Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021
☆14Jan 11, 2021Updated 5 years ago
NLP2CT / Meta-Curriculum
View on GitHub
Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (AAAI 2021)
☆26Jun 18, 2022Updated 4 years ago
Parkchanjun / SKC_MachineTranslation
View on GitHub
SKC_MachineTranslation 강의자료
☆12Nov 7, 2019Updated 6 years ago
gombru / SocialMediaWeakLabeling
View on GitHub
Learning to learn from Web and Social Media data. Joint image-text embeddings oriented to image by text retrieval.
☆18Feb 10, 2022Updated 4 years ago
teslacool / SCA
View on GitHub
Soft Contextual Data Augmentation
☆39Jul 25, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
langtech-bsc / mt-evaluation
View on GitHub
A framework for evaluating Machine Translation models.
☆13Apr 21, 2026Updated 3 months ago
cocaer / simpleNMT
View on GitHub
☆18Nov 25, 2020Updated 5 years ago
howl-anderson / Chinese_tokenizer_benchmark
View on GitHub
中文分词软件基准测试 | Chinese tokenizer benchmark
☆26Sep 5, 2018Updated 7 years ago
eliorsulem / simplification-acl2018
View on GitHub
Human Evaluation Benchmark for Text Simplification
☆10Sep 6, 2018Updated 7 years ago
ratazzi / tesseract-ocr
View on GitHub
A Python wrapper for Tesseract
☆20Sep 27, 2015Updated 10 years ago
MaxyLee / 3AM
View on GitHub
Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"
☆12Dec 8, 2024Updated last year
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
sheffieldnlp / reading_group
View on GitHub
☆29Jun 5, 2022Updated 4 years ago
JHL-HUST / AdvNMT-WSLS
View on GitHub
Crafting Adversarial Examples for Neural Machine Translation
☆10Apr 7, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tmu-nlp / sscorpus
View on GitHub
A monolingual parallel corpus for sentence simplification
☆11Jul 4, 2016Updated 10 years ago
Parkchanjun / DeepLearning_Basic_Tutorial
View on GitHub
Deep Learning Basic Tutorial (Pytorch, Keras)
☆17Nov 8, 2019Updated 6 years ago
roeeaharoni / string-to-tree-nmt
View on GitHub
Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"
☆16Dec 31, 2017Updated 8 years ago
Avmb / deep-nmt-architectures
View on GitHub
Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"
☆11Jul 13, 2017Updated 9 years ago
ohlionel / Prune-Tune
View on GitHub
Official code repository for AAAI2021 paper Finding Sparse Structures for Domain Specific Neural Machine Translation
☆11Apr 1, 2021Updated 5 years ago
MultiPath / SEG-NMT
View on GitHub
Search Engine Guided Non-Parametric Neural Machine Translation
☆14Oct 23, 2017Updated 8 years ago
cenksari / react-seatmap-creator
View on GitHub
A powerful and intuitive seat map creator built with TypeScript and React. Design, customize, and publish your own dynamic seat charts ef…
☆22Jun 12, 2026Updated last month
mttravel / Dictionary-based-MT
View on GitHub
☆18Sep 26, 2020Updated 5 years ago
zhexiao / office-parser
View on GitHub
把教育信息化体系中的Word试题，Excel试卷、知识点等数据解析成json内容。
☆14Mar 3, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / bitext-lexind
View on GitHub
Bilingual lexicons map words in one language to their translations in another, and are typically induced by learning linear project…
☆19Jun 1, 2021Updated 5 years ago
suyash / mlt
View on GitHub
Multilingual Neural Machine Translation using Transformers with Conditional Normalization.
☆18Mar 24, 2023Updated 3 years ago
longdt219 / XlingualEmb
View on GitHub
Crosslingual word embeddings described in our EMNLP paper
☆16Sep 21, 2016Updated 9 years ago
Helsinki-NLP / OpusTools
View on GitHub
☆83Jun 24, 2026Updated last month
HaebinShin / stanford-sentiment-dataset
View on GitHub
Refined dataset for Stanford Sentiment Treebank used in Yoon Kim (2014).
☆12Apr 1, 2018Updated 8 years ago
xuqiongkai / ALTER
View on GitHub
ALTER: Auxiliary Text Rewriting Tool for Natural Language Generation
☆17Dec 10, 2022Updated 3 years ago
XingxingZhang / pysari
View on GitHub
Text Simplification System and Dataset
☆15Jul 19, 2017Updated 9 years ago