MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation output
☆22Oct 29, 2017Updated 8 years ago
Alternatives and similar repositories for marmot
Users that are interested in marmot are comparing it to the libraries listed below
Sorting:
- Pascal2 Harvest project QuEst☆14Sep 15, 2014Updated 11 years ago
- KiwiCutter is a simple introduction to using OpenKiwi☆13Dec 8, 2022Updated 3 years ago
- machine translation and quality estimation☆35Jan 13, 2019Updated 7 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- Datasets for machine translation☆10Jul 5, 2019Updated 6 years ago
- Translation Error Rate (TER)☆45May 25, 2018Updated 7 years ago
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- The 14th Machine Translation Marathon 2019 in Edinburgh☆13Dec 8, 2022Updated 3 years ago
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 3 years ago
- tensor2tensor usage☆11Apr 20, 2019Updated 6 years ago
- Transformer based translation quality estimation☆114Jul 20, 2023Updated 2 years ago
- Allows language communities to build their own dictionaries. Development is tracked at https://jira.sil.org/projects/WS☆19Jan 30, 2026Updated last month
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Jul 11, 2019Updated 6 years ago
- Terminology management web platform☆50Apr 22, 2022Updated 3 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Aug 31, 2021Updated 4 years ago
- ☆20Aug 17, 2021Updated 4 years ago
- 机器翻译子任务-翻译质量评价-复现 WMT2018 阿里论文结果☆20Mar 12, 2019Updated 6 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Appraise code used as part of WMT21 human evaluation campaign☆30Dec 15, 2025Updated 2 months ago
- Search comments and highlights annotations in PDF documents.☆12May 4, 2023Updated 2 years ago
- A LibreOffice extension that converts JabRef references to plain text code and vice versa so that you can use your references with MS Off…☆12Aug 15, 2024Updated last year
- Interactive map of the Italian regions and provinces.☆10Apr 5, 2018Updated 7 years ago
- Open-source framework for science presentations and talks, generating interactive HTML from simple Python based interface.☆13Jul 21, 2025Updated 7 months ago
- Multilingual Language Modeling Toolkit☆11May 25, 2017Updated 8 years ago
- Search for a given subsequence in a list of strings and transform the resulting list as required☆18Nov 15, 2015Updated 10 years ago
- A Python script to delete all comment and submission data from a given Reddit account.☆11Jan 5, 2021Updated 5 years ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- Open-Source Machine Translation Quality Estimation in PyTorch☆232Jun 23, 2022Updated 3 years ago
- Lexically constrained decoding for sequence generation using Grid Beam Search☆94Aug 29, 2018Updated 7 years ago
- Improving the Transformer translation model with document-level context☆170Jul 7, 2020Updated 5 years ago
- A Python script that generates a colorful voxel city and writes it to a MagicaVoxel .vox file☆14Aug 18, 2020Updated 5 years ago
- Pretrained segmenter models for Portuguese legislative text.☆13Oct 13, 2024Updated last year
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- A curated list of awesome Molecular Modeling And Drug Discovery 🔥☆11Jul 21, 2022Updated 3 years ago
- A LLM-powered agent for NetHack☆20Nov 4, 2024Updated last year
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 6 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year