loretoparisi / fastLangIDLinks

Stand-alone Language Identification for Node.js JavaScript based on FastText

☆7

Alternatives and similar repositories for fastLangID

Users that are interested in fastLangID are comparing it to the libraries listed below

Sorting:

transducens / linguacrawl
Crawling engine that crawls a set of top-level domains looking for documents in a list of languages
☆11Updated last year
vincent9514 / Text-Variant-Generation
📄Neural Sentential Paraphrase Generation to Augment Chatbot Training Dataset
☆21Updated 2 years ago
oscar-project / goclassy
An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
☆86Updated 4 years ago
davidsbatista / lexicons
Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.
☆28Updated 8 years ago
krandiash / gpt3-nli
Training a model without a dataset for natural language inference (NLI)
☆25Updated 4 years ago
EtienneAb3d / OpenNeuroSpell
OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon a…
☆20Updated 8 months ago
shashiongithub / Split-and-Rephrase
The WebSplit Benchmark introducing "Split and Rephrase" task
☆63Updated 6 years ago
MilaNLProc / bertlang
A web interface to understand language-specific BERT-models
☆18Updated last year
AudayBerro / automatedParaphrase
Automated paraphrases Generation
☆36Updated 2 years ago
jwieting / para-nmt-50m
Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…
☆102Updated last year
utkd / encdecmodel-hf
☆34Updated 4 years ago
wangcongcong123 / ttt
A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+
☆38Updated 4 years ago
ghaddarAbs / WiNER
☆33Updated 3 years ago
Oneplus / Tweebank
A collection of English tweets annotated in Universal Dependencies.
☆39Updated 3 years ago
wietsedv / gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Updated 3 years ago
GorkaUrbizu / Coreference-Corpora-Resources
List of corpora annotated for coreference for different languages
☆17Updated 11 months ago
mayhewsw / pytorch-truecaser
A simple neural truecaser written in pytorch and allennlp.
☆33Updated last year
writer / fitbert
Use BERT to Fill in the Blanks
☆83Updated 3 years ago
loretoparisi / tensorflow-node-examples
Tensorflow Node.js Examples
☆25Updated 2 years ago
bedapudi6788 / txt2txt
Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.
☆39Updated 4 years ago
loretoparisi / fasttext.js
FastText for Node.js
☆196Updated 2 years ago
TurkuNLP / wikibert
BERT models for many languages created from Wikipedia texts
☆33Updated 5 years ago
Geotrend-research / smaller-transformers
Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆103Updated 3 years ago
google-research-datasets / wiki-split
One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
☆123Updated 6 years ago
stefan-it / german-gpt2
German GPT-2 model
☆32Updated 3 years ago
indix / whatthelang
Lightning Fast Language Prediction 🚀
☆167Updated 6 years ago
dbpedia / neural-qa
📚 A Neural QA Model for DBpedia using Neural SPARQL Machines.
☆85Updated last year
yanaiela / num_fh
numeric fused-head identification and resolution
☆33Updated 5 years ago
TurkuNLP / ocr-correction
Post-processing OCR errors with seq2seq models
☆28Updated 4 years ago
anoopkunchukuttan / indic_nlp_resources
Resources to go with the Indic NLP Library
☆73Updated 3 years ago