kindlychung / af-aligner
LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence pairing. Input: txt, doc, docx, rtf, pdf, html. Output: tab delimited txt, TMX and xls. With web features.
☆11Updated 9 years ago
Alternatives and similar repositories for af-aligner:
Users that are interested in af-aligner are comparing it to the libraries listed below
- IPA Pronunciation Dictionaries in DSL format☆39Updated 8 years ago
- Aligned bilingual word vectors for English and Chinese☆11Updated 6 years ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago
- Tools for professional translators running GNU/Linux☆27Updated 3 years ago
- TMX Editor written in Java and TypeScript☆41Updated last month
- Sentence aligner☆109Updated 3 years ago
- repo for Tibetan corpora☆21Updated last year
- Website and documentation☆19Updated 3 weeks ago
- Bitextor generates translation memories from multilingual websites☆293Updated 2 months ago
- Translation demonstrator☆29Updated 4 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆52Updated 9 years ago
- Interactive visualization of Wiktionary words and etymologies.☆91Updated 2 months ago
- universal syllabification algorithms☆44Updated 2 years ago
- Translation Memory Open-source Purifier☆33Updated 2 years ago
- Global ASP - African Storybook Project for the World☆14Updated 2 months ago
- Port of the OpenFST library to Windows☆70Updated 8 months ago
- Spoken Cantonese from Hong Kong.☆29Updated 2 months ago
- Bunachar Náisiúnta Moirfeolaíochta | Irish National Morphology Database☆22Updated 7 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- British English pronunciation dictionary☆92Updated 7 years ago
- Convert Wiktionary entries to various formats such as StarDict or DB (MariaDB/MySQL). This used to be the main repository for this projec…☆15Updated 2 years ago
- 😎 Curated list of Tibetan NLP projects☆36Updated 4 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆35Updated this week
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆86Updated 3 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 4 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Easier analysis of large speech corpora☆22Updated 3 years ago
- Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).☆52Updated 9 months ago