The Berkeley Word Aligner
☆23Mar 24, 2016Updated 10 years ago
Alternatives and similar repositories for berkeleyaligner
Users that are interested in berkeleyaligner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the implementation of word aligner using Hidden Markov Model☆10Jun 24, 2019Updated 6 years ago
- Learn Classical Statistical Machine Translation Systems.☆18May 27, 2020Updated 5 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- A simple TensorFlow implementation of the Transformer☆25Jan 7, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Aligned bilingual word vectors for English and Chinese☆11Jun 25, 2018Updated 7 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆166May 12, 2021Updated 4 years ago
- Multiview LSA☆11Jun 22, 2015Updated 10 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- Code for "Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation"☆13Jul 10, 2020Updated 5 years ago
- ☆15Jul 16, 2021Updated 4 years ago
- Simple, fast unsupervised word aligner☆770Jul 19, 2022Updated 3 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆273Nov 18, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Jul 23, 2017Updated 8 years ago
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- Explicit Sentence Compression for Neural Machine Translation☆10May 12, 2020Updated 5 years ago
- Word sense disambiguation test sets for NMT☆20Dec 3, 2020Updated 5 years ago
- Adversarial Machine Translation with pytorch☆23Jan 14, 2018Updated 8 years ago
- Generalized Data Augmentation for Low-Resource Translation☆12Jul 30, 2019Updated 6 years ago
- This is an activator project for showcasing how to read & write data from Kafka-cluster using Scala Producer & Consumer API.☆11May 28, 2017Updated 8 years ago
- crf-seg:用于生产环境的中文分词处理工具,可自定义语料、可自定义模型、架构清晰,分词效果好。java编写。☆14Dec 11, 2021Updated 4 years ago
- Program used to split text into segments☆28Oct 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Apr 28, 2019Updated 6 years ago
- Joint Source-Target Self Attention with Locality Constraints☆20May 9, 2020Updated 5 years ago
- ☆10Oct 20, 2020Updated 5 years ago
- Bilingual sengence aligner☆29Nov 25, 2025Updated 4 months ago
- A chatbot which is designed for open source community, able to answer open source related questions and guide you to do OSS.☆13Apr 2, 2023Updated 3 years ago
- preprocessing of the MUC4 dataset☆11Aug 28, 2012Updated 13 years ago
- A web-application for phrase-structure annotation in general, and UCCA annotation in particular☆15Jan 12, 2023Updated 3 years ago
- Code for "Unsupervised Cross-lingual Transfer of Word Embedding Spaces" in EMNLP 2018☆24Dec 29, 2018Updated 7 years ago
- gated cnn (Language Modeling with Gated Convolutional Networks)☆18Jan 3, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- CHASE is a large-scale and pragmatic Chinese dataset for cross-database context-dependent text-to-SQL task (natural language interfaces f…☆10May 7, 2021Updated 4 years ago
- Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.☆23Aug 20, 2021Updated 4 years ago
- ☆17Jul 6, 2020Updated 5 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Mar 16, 2021Updated 5 years ago
- Record my paper reading about Machine Translation and other related works.☆36Nov 19, 2021Updated 4 years ago
- A C++ toolkit for neural machine translation for CPU☆88Jun 11, 2019Updated 6 years ago
- Dock You a Moses: Moses Statistical MT in a container☆14Feb 18, 2020Updated 6 years ago