openredact / nerwhalLinks
This is a prototype of a multi-lingual suite for named-entity recognition in Python.
☆21Updated last year
Alternatives and similar repositories for nerwhal
Users that are interested in nerwhal are comparing it to the libraries listed below
Sorting:
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated last year
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- Combining encoder-based language models☆11Updated 4 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- ☆30Updated 3 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 3 years ago
- A text augmentation tool for named entity recognition.☆54Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆66Updated 4 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- ☆34Updated 2 years ago
- ☆43Updated 2 years ago
- Model for predicting categories of entities by its mentions☆31Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 4 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆62Updated 5 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated 10 months ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- Data programming by demonstration for information extraction and span annotation☆34Updated 4 years ago
- ☆19Updated 6 years ago
- Keras Implementation of Flair's Contextualized Embeddings☆26Updated 4 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- ☆69Updated 4 years ago
- Open source library for few shot NLP☆78Updated 2 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 4 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 4 years ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago