czcorpus / InterText_serverLinks
Collaborative on-line editor for aligned parallel texts.
☆13Updated 3 years ago
Alternatives and similar repositories for InterText_server
Users that are interested in InterText_server are comparing it to the libraries listed below
Sorting:
- Bilingual sengence aligner☆28Updated 2 years ago
- Sentence aligner☆119Updated 4 years ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 5 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆78Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆295Updated 11 months ago
- Efficient Markov Chain word alignment☆53Updated 4 years ago
- ☆78Updated 2 months ago
- ☆48Updated 6 years ago
- Pipelined quality estimation.☆51Updated 6 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- ☆42Updated 7 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆74Updated 10 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 4 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆33Updated 2 months ago
- Efficient Low-Memory Aligner☆146Updated 9 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated last week
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 6 years ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆26Updated 6 years ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆123Updated 6 years ago
- Alignment and annotation for comparable documents.☆22Updated 7 years ago
- Various utilities for processing the data.☆213Updated this week
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- A High-Quality Multilingual Dataset for Structured Documentation Translation☆36Updated 6 months ago
- Appraise evaluation system for manual evaluation of machine translation output☆77Updated 4 years ago