AU-DIS / LSTM_langid
Source code for the Apple reproduction
☆32Updated 4 years ago
Alternatives and similar repositories for LSTM_langid
Users that are interested in LSTM_langid are comparing it to the libraries listed below
Sorting:
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 11 months ago
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 3 months ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆60Updated 4 years ago
- Bilingual sentence similarity classifier using Tensorflow☆21Updated 5 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆50Updated 3 weeks ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Multilingual Open Text☆25Updated last week
- Code for pre-training CharacterBERT models (as well as BERT models).☆34Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- ☆33Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆102Updated 2 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆54Updated 4 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 2 years ago
- zero-vocab or low-vocab embeddings☆18Updated 2 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 4 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 8 months ago
- Code for Detecting language from text in python using fasttext☆13Updated 4 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-ha…☆39Updated 2 years ago
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- ☆17Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 5 months ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago