Kaleidophon / token2index
A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and Tensorflow.
☆51Updated 2 months ago
Alternatives and similar repositories for token2index:
Users that are interested in token2index are comparing it to the libraries listed below
- ☆74Updated 3 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated last month
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 8 months ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- A embed able annotation tool for end to end cross document co-reference☆41Updated last year
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…☆21Updated 5 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Updated 2 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆79Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- codebase for the Text-based NP Enrichment (TNE) paper☆20Updated 11 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- ☆17Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆44Updated 9 months ago
- reference pytorch code for intent classification☆44Updated 4 months ago
- Formate converter from one type of qa task datasets to another type☆39Updated 6 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Updated 3 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset☆49Updated 2 years ago