mhagiwara / nanigonet
NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks
☆72Updated last year
Alternatives and similar repositories for nanigonet:
Users that are interested in nanigonet are comparing it to the libraries listed below
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- ☆64Updated 2 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 8 months ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆33Updated 3 years ago
- Code and data for segmentation experiments.☆22Updated 10 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆49Updated 3 years ago
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- Topic Inference with Zeroshot models☆61Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Updated 4 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network☆21Updated 7 years ago
- Tool for parsing and converting various span encoding schemes.☆22Updated last year
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆106Updated 3 years ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆42Updated last year
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated last month
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- ☆17Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 10 months ago
- ☆33Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago