kindlychung / af-aligner
LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence pairing. Input: txt, doc, docx, rtf, pdf, html. Output: tab delimited txt, TMX and xls. With web features.
☆11Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for af-aligner
- Sentence aligner☆108Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- IPA Pronunciation Dictionaries in DSL format☆39Updated 7 years ago
- Bitextor generates translation memories from multilingual websites☆291Updated last week
- repo for Tibetan corpora☆21Updated last year
- Extract and align grammar patterns from English sentences.☆54Updated last year
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Tools for professional translators running GNU/Linux☆27Updated 2 years ago
- An algorithm to check grammatical correctness and suggest improvements.☆65Updated 7 years ago
- TMX Editor written in Java and TypeScript☆41Updated last month
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- Creates interlinearized versions of books (EPUB, MOBI, etc), adding "subtitles" with translations under each word in the text.☆22Updated 4 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Pronunciation dictionaries for several languages, based on Wiktionary data.☆17Updated 2 years ago
- Translation Memory Open-source Purifier☆33Updated 2 years ago
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆47Updated 11 months ago
- Collaborative on-line editor for aligned parallel texts.☆12Updated 2 years ago
- Данные 6-го издания «Грамматического словаря русского языка» А. А. Зализняка (2010) в виде текстовых файлов☆17Updated 2 months ago
- Extracts plain text, language identification and more metadata from WARC records☆20Updated 3 months ago
- Multilingual sentence alignment using sentence embeddings☆101Updated 3 weeks ago
- ☆30Updated 5 months ago
- The Open Parallel Corpus☆57Updated 2 weeks ago
- Translate HTML using Argos Translate☆49Updated last year
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆22Updated 2 years ago
- Arabic Transliteration in Python☆33Updated 11 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆26Updated 2 months ago
- Jason Riggle's chart of phonological features in JSON format + extras☆49Updated 4 months ago
- Read-only unofficial mirror of OpenFst☆43Updated 2 years ago