lyeoni / prenlp
Preprocessing Library for Natural Language Processing
☆160Updated last year
Related projects ⓘ
Alternatives and complementary repositories for prenlp
- A tutorial of pertaining Bert on your own dataset using google TPU☆44Updated 4 years ago
- Package for controllable summarization☆78Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 4 years ago
- architectures and pre-trained models for long document classification.☆154Updated 3 years ago
- A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 ) and related Language Models in production environments.☆94Updated 3 years ago
- Minimal Interactive Attention Visualization☆138Updated 4 years ago
- Named Entity Recognition on CoNLL dataset using BiLSTM+CRF implemented with Pytorch☆42Updated 5 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆80Updated 2 years ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- Exploring the simple sentence similarity measurements using word embeddings☆100Updated 2 months ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆152Updated 2 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆193Updated 2 years ago
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆198Updated 5 months ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- ☆73Updated 6 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆159Updated 4 years ago
- Scripts to train a bidirectional LSTM with knowledge distillation from BERT☆157Updated 4 years ago
- This repository contains various ways to calculate sentence vector similarity using NLP models☆200Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆363Updated 2 years ago
- ⛔ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.☆88Updated last year
- Massively Multilingual Transfer for NER☆85Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆132Updated 5 years ago
- Viewer for the 🤗 datasets library.☆83Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- reference pytorch code for named entity tagging☆86Updated 3 weeks ago
- A collection/reading-list of awesome Natural Language Processing papers sorted by date.☆33Updated 5 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆114Updated 5 years ago
- Tensorflow and Keras implementation of the state of the art researches in Dialog System NLU☆98Updated 3 years ago
- Implementation of NeurIPS 19 paper: Paraphrase Generation with Latent Bag of Words☆123Updated 3 years ago
- Evaluation script for named entity recognition (NER) systems based on entity-level F1 score.☆71Updated 3 years ago