saffsd / langid.jsLinks
An off-the-shelf client-side language identification module for JavaScript.
☆16Updated 11 years ago
Alternatives and similar repositories for langid.js
Users that are interested in langid.js are comparing it to the libraries listed below
Sorting:
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆200Updated 5 years ago
- Neural Adaptive Machine Translation that adapts to context and learns from corrections.☆350Updated 3 years ago
- Universal Dependencies online documentation☆287Updated this week
- NLTK Contrib☆169Updated last year
- The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.☆212Updated 5 years ago
- Transliteration data and models☆56Updated 9 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated last month
- A fast and accurate POS and morphological tagging toolkit (EACL 2014)☆149Updated 5 years ago
- Crawler for linguistic corpora☆213Updated 5 months ago
- Sentence aligner☆124Updated 4 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated this week
- Thot toolkit for statistical machine translation☆53Updated 3 years ago
- German Morphological Analyzer☆51Updated 4 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Updated 8 years ago
- Various utilities for processing the data.☆217Updated this week
- A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to charac…☆165Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆300Updated last year
- Fast Word Segmentation with Triangular Matrix☆86Updated 4 years ago
- Language Detection with Infinity-gram☆230Updated 10 years ago
- Automatically exported from code.google.com/p/foma☆128Updated 4 months ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 3 years ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated last month
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Fast approximate strings search & spelling correction☆60Updated 4 years ago
- FreeLing project source code☆260Updated 2 years ago
- Faster, modernized fork of the language identification tool langid.py☆60Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated last year
- A code for transliterating (romanizing) Arabic text using the American Library Association - Library of Congress (ALA-LC) standard☆49Updated 3 years ago
- Program used to split text into segments☆29Updated last year
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆26Updated 8 years ago