openredact / nerwhal
This is a prototype of a multi-lingual suite for named-entity recognition in Python.
☆21Updated 11 months ago
Alternatives and similar repositories for nerwhal:
Users that are interested in nerwhal are comparing it to the libraries listed below
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 11 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Combining encoder-based language models☆11Updated 3 years ago
- ☆30Updated 2 years ago
- A few-shot learning method based on siamese networks.☆28Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆31Updated last year
- Neural-IR-Explorer: A Content-Focused Tool to Explore Neural Re-Ranking Results☆33Updated 5 years ago
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆34Updated 4 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- ☆22Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- ☆19Updated 5 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated last month
- Converter from UD-trees to BART representation☆36Updated last year
- ☆16Updated last year
- ☆34Updated last year
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated 7 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 4 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28Updated 2 years ago