mhagiwara / nanigonet
NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks
☆71Updated last year
Related projects: ⓘ
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 3 years ago
- numeric fused-head identification and resolution☆33Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆80Updated 2 years ago
- ⚡️ AllenNLP plugin for adding subcommands to use Optuna, making hyperparameter optimization easy☆32Updated 2 years ago
- ☆73Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆105Updated 3 years ago
- c++ mosestokenizer☆16Updated 6 months ago
- Robsut Wrod Reocginiton via semi-Character Recurrent Neural Network☆21Updated 6 years ago
- Doing things with embeddings☆64Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆125Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆72Updated 2 months ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆48Updated 3 years ago
- SDK for TEASPN, a framework and a protocol for integrated writing assistance environments☆61Updated last year
- Build a dialog dataset from online books in many languages☆71Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Code and data for segmentation experiments.☆21Updated 9 years ago
- Viewer for the 🤗 datasets library.☆83Updated 3 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆94Updated last year
- LM Pretraining with PyTorch/TPU☆131Updated 4 years ago
- Source code accompanying the KONVENS 2019 paper "Does BERT Make Any Sense? Interpretable Word Sense Disambiguation with Contextualized Em…☆61Updated 4 years ago
- ☆34Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆131Updated last year
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- jiant-dev☆28Updated 3 years ago
- ☆64Updated last year
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- A collection of selected of models built with AllenNLP.☆25Updated 4 years ago