loretoparisi / fastLangIDLinks
Stand-alone Language Identification for Node.js JavaScript based on FastText
β7Updated 6 years ago
Alternatives and similar repositories for fastLangID
Users that are interested in fastLangID are comparing it to the libraries listed below
Sorting:
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languagesβ11Updated last year
- πNeural Sentential Paraphrase Generation to Augment Chatbot Training Datasetβ21Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.β86Updated 4 years ago
- Dictionaries of names, surnames, acronyms and it's extensions, stop-words, etc., which I gathered for different experiments.β28Updated 8 years ago
- Training a model without a dataset for natural language inference (NLI)β25Updated 4 years ago
- OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon aβ¦β20Updated 8 months ago
- The WebSplit Benchmark introducing "Split and Rephrase" taskβ63Updated 6 years ago
- A web interface to understand language-specific BERT-modelsβ18Updated last year
- Automated paraphrases Generationβ36Updated 2 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions oβ¦β102Updated last year
- β34Updated 4 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β38Updated 4 years ago
- β33Updated 3 years ago
- A collection of English tweets annotated in Universal Dependencies.β39Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)β48Updated 3 years ago
- List of corpora annotated for coreference for different languagesβ17Updated 11 months ago
- A simple neural truecaser written in pytorch and allennlp.β33Updated last year
- Use BERT to Fill in the Blanksβ83Updated 3 years ago
- Tensorflow Node.js Examplesβ25Updated 2 years ago
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.β39Updated 4 years ago
- FastText for Node.jsβ196Updated 2 years ago
- BERT models for many languages created from Wikipedia textsβ33Updated 5 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.β103Updated 3 years ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.β123Updated 6 years ago
- German GPT-2 modelβ32Updated 3 years ago
- Lightning Fast Language Prediction πβ167Updated 6 years ago
- π A Neural QA Model for DBpedia using Neural SPARQL Machines.β85Updated last year
- numeric fused-head identification and resolutionβ33Updated 5 years ago
- Post-processing OCR errors with seq2seq modelsβ28Updated 4 years ago
- Resources to go with the Indic NLP Libraryβ73Updated 3 years ago