saffsd / langid.cLinks
Pure C natural language identifier with support for 97 languages
☆26Updated 8 years ago
Alternatives and similar repositories for langid.c
Users that are interested in langid.c are comparing it to the libraries listed below
Sorting:
- Simhashing in C++☆136Updated 2 years ago
- C++ implement of Tomas Mikolov's word/document embedding☆106Updated 8 years ago
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆81Updated 5 years ago
- A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )☆256Updated 3 years ago
- Fast Neural Machine Translation in C++ - development repository☆284Updated 6 months ago
- Compact Language Detector 2☆890Updated 4 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆329Updated 3 weeks ago
- C++ implementation for Neural Network-based NLP, such as LSTM machine translation!☆86Updated 8 years ago
- Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combinin…☆80Updated 4 months ago
- Faster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax☆564Updated 3 years ago
- Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) contex…☆185Updated 5 years ago
- C++ wrapper library for the NLP library spaCy☆107Updated 2 years ago
- ☆31Updated 3 years ago
- Word2Vec in C++ 11☆406Updated 9 years ago
- A multilingual dependency parser based on linear programming relaxations.☆115Updated 6 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago
- A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.☆149Updated 5 years ago
- Im2Text extension to OpenNMT☆138Updated 8 years ago
- MIT Language Modeling Toolkit☆118Updated 6 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆129Updated this week
- MARISA: Matching Algorithm with Recursively Implemented StorAge☆594Updated last week
- BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/b…☆228Updated 4 years ago
- Citar part of speech tagger☆39Updated 9 years ago
- TheanoLM is a recurrent neural network language modeling tool implemented using Theano☆81Updated last year
- long short-term memory for recursive neural network model☆66Updated 6 years ago
- This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…☆33Updated 10 years ago
- ☆181Updated last month
- A clone of Darts (Double-ARray Trie System)☆159Updated 8 months ago
- A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency co…☆20Updated 10 years ago
- Corpus preprocessing☆99Updated last year