sai-prasanna / lmproof
Language model powered proof reader for correcting contextual errors in natural language.
☆24Updated last year
Related projects: ⓘ
- numeric fused-head identification and resolution☆33Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 2 years ago
- ☆17Updated last year
- Data Programming by Demonstration (DPBD) for Document Classification☆36Updated 3 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆31Updated 3 months ago
- ☆73Updated 3 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Updated 3 years ago
- The NLPStatTest project☆11Updated 2 years ago
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆16Updated 2 years ago
- c++ mosestokenizer☆16Updated 6 months ago
- Tool for parsing and converting various span encoding schemes.☆20Updated 8 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 2 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆71Updated last year
- A collection of selected of models built with AllenNLP.☆25Updated 4 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆43Updated 4 months ago
- Train transformer-based models.☆28Updated this week
- Differnable Readability Measure Regularizer for Neural Network Automatic Text Simplification☆24Updated last year
- Converter from UD-trees to BART representation☆37Updated 6 months ago
- Build a dialog dataset from online books in many languages☆71Updated last year
- ☆15Updated last year
- 💫 A spaCy package for Yohei Tamura's Rust tokenizations library☆27Updated 10 months ago
- Automatically detect errors in annotated corpora.☆45Updated last year
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Updated 7 months ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆63Updated last year