mhagiwara / nanigonetLinks
NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks
☆71Updated 2 years ago
Alternatives and similar repositories for nanigonet
Users that are interested in nanigonet are comparing it to the libraries listed below
Sorting:
- numeric fused-head identification and resolution☆33Updated 5 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 4 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆108Updated 4 years ago
- A collection of selected of models built with AllenNLP.☆25Updated 5 years ago
- Code and data for segmentation experiments.☆20Updated 10 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 10 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- LM Pretraining with PyTorch/TPU☆136Updated 5 years ago
- A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.☆124Updated last year
- Decoding platform for machine translation research☆55Updated 6 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆78Updated 6 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆152Updated 3 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 5 years ago
- Train transformer-based models.☆28Updated this week
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Incremental learning of word embeddings with context informativeness.☆94Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆105Updated 6 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆82Updated last year
- ☆17Updated 2 years ago
- A simple demo server for AllenNLP models.☆27Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated 2 years ago
- Framework for weakly supervised deep sequence taggers, focused on named entity recognition☆78Updated 2 years ago