saffsd / langid.cLinks

Pure C natural language identifier with support for 97 languages

☆26

Alternatives and similar repositories for langid.c

Users that are interested in langid.c are comparing it to the libraries listed below

Sorting:

seomoz / simhash-cpp
Simhashing in C++
☆136Updated 2 years ago
hiyijian / doc2vec
C++ implement of Tomas Mikolov's word/document embedding
☆106Updated 8 years ago
OpenNMT / CTranslate
Lightweight C++ translator for OpenNMT Torch models (deprecated)
☆81Updated 5 years ago
Jekub / Wapiti
A simple and fast discriminative sequence labeling toolkit ( http://wapiti.limsi.fr )
☆256Updated 3 years ago
marian-nmt / marian-dev
Fast Neural Machine Translation in C++ - development repository
☆284Updated 6 months ago
CLD2Owners / cld2
Compact Language Detector 2
☆890Updated 4 years ago
OpenNMT / Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
☆329Updated 3 weeks ago
hassyGo / N3LP
C++ implementation for Neural Network-based NLP, such as LSTM machine translation!
☆86Updated 8 years ago
ufal / unilib
Embeddable C++17 Unicode library offering UTF encodings, general category info, simple and full casing, normalization forms, and combinin…
☆80Updated 4 months ago
yandex / faster-rnnlm
Faster Recurrent Neural Network Language Modeling Toolkit with Noise Contrastive Estimation and Hierarchical Softmax
☆564Updated 3 years ago
redpony / cdec
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) contex…
☆185Updated 5 years ago
d99kris / spacy-cpp
C++ wrapper library for the NLP library spaCy
☆107Updated 2 years ago
AtheS21 / SymspellCPP
☆31Updated 3 years ago
jdeng / word2vec
Word2Vec in C++ 11
☆406Updated 9 years ago
andre-martins / TurboParser
A multilingual dependency parser based on linear programming relaxations.
☆115Updated 6 years ago
frcchang / zpar
ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…
☆135Updated 9 years ago
mischasan / aho-corasick
A-C implementation in "C". Tight-packed (interleaved) state-transition matrix -- as fast as it gets, as small as it gets.
☆149Updated 5 years ago
OpenNMT / Im2Text
Im2Text extension to OpenNMT
☆138Updated 8 years ago
mitlm / mitlm
MIT Language Modeling Toolkit
☆118Updated 6 years ago
proycon / colibri-core
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…
☆129Updated this week
s-yata / marisa-trie
MARISA: Matching Algorithm with Recursively Implemented StorAge
☆594Updated last week
BLLIP / bllip-parser
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/b…
☆228Updated 4 years ago
danieldk / citar-cxx
Citar part of speech tagger
☆39Updated 9 years ago
senarvi / theanolm
TheanoLM is a recurrent neural network language modeling tool implemented using Theano
☆81Updated last year
lephong / lstm-rnn
long short-term memory for recursive neural network model
☆66Updated 6 years ago
vimalmanohar / old-kaldi-git
This is not the official kaldi repository. It is better to fork https://github.com/kaldi-asr/kaldi or https://github.com/vimalmanohar/kal…
☆33Updated 10 years ago
tlwg / libdatrie
☆181Updated last month
s-yata / darts-clone
A clone of Darts (Double-ARray Trie System)
☆159Updated 8 months ago
jerry2yu / ngrams
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency co…
☆20Updated 10 years ago
kpu / preprocess
Corpus preprocessing
☆99Updated last year