cisnlp / parcoureView external linksLinks
ParCourE - Parallel Corpus Explorer
☆12Dec 27, 2021Updated 4 years ago
Alternatives and similar repositories for parcoure
Users that are interested in parcoure are comparing it to the libraries listed below
Sorting:
- Domain-Specific Text Generation for Machine Translation (with LLMs) - scripts and config files for the paper☆18Aug 19, 2023Updated 2 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- ☆21May 30, 2022Updated 3 years ago
- This will download and process the Google Ngram data.☆24Nov 29, 2022Updated 3 years ago
- A Pytorch-based Neural Machine Translation Framework for Research☆26Nov 5, 2020Updated 5 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆34Sep 4, 2025Updated 5 months ago
- ☆29Jun 10, 2024Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- ☆28Feb 24, 2025Updated 11 months ago
- A multilingual parallel corpus created from translations of the Bible.☆191May 19, 2025Updated 8 months ago
- Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.☆32Dec 27, 2022Updated 3 years ago
- Translation Memory Open-source Purifier☆35Nov 6, 2022Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 3 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 6 months ago
- ☆12Aug 24, 2022Updated 3 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 3 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 8 months ago
- Simultaneous NMT/MMT framework in PyTorch☆38Mar 22, 2025Updated 10 months ago
- Framework for neural-based Quality Estimation☆41Sep 23, 2020Updated 5 years ago
- ☆10May 22, 2022Updated 3 years ago
- We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).☆88Jun 2, 2021Updated 4 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆37Apr 26, 2019Updated 6 years ago
- code for Teaching LM to Translate with Comparison☆39Dec 15, 2023Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆387Nov 7, 2023Updated 2 years ago
- Lars's datasets☆12Jun 16, 2024Updated last year
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Pretrained segmenter models for Portuguese legislative text.☆13Oct 13, 2024Updated last year
- Bluebell is a generic Akoma Ntoso 3 parser.☆19Jan 5, 2026Updated last month
- Optimized inference with Ascend and Hugging Face☆12Apr 23, 2024Updated last year
- Complete set of English dialect transformation rules and evaluation code☆16Jun 7, 2024Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- Deprecated. Please see https://github.com/OpenConceptLab/oclweb2☆11Jul 6, 2021Updated 4 years ago
- A simple mock API server using expressjs that is hosted on firebase.☆10Jun 29, 2022Updated 3 years ago
- In this project, you'll train a convolutional neural network to classify and recognize different categories of fonts. We'll be using the …☆13Feb 29, 2020Updated 5 years ago
- Morphological analysis for Udmurt.☆12Nov 5, 2025Updated 3 months ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 11 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago