crux82 / squad-itLinks
A large scale dataset for Question Answering in Italian
☆27Updated 6 years ago
Alternatives and similar repositories for squad-it
Users that are interested in squad-it are comparing it to the libraries listed below
Sorting:
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- ☆64Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 4 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆155Updated last year
- A High-level Library for Named Entity Recognition in Python.☆23Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 6 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- GilBERTo: A pretrained language model based on RoBERTa for Italian☆73Updated 5 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 11 months ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- IR-BERT at TREC 2020: Leveraging BERT for Semantic Search in Background Linking☆14Updated 3 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆105Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- NLP @ TU Wien☆18Updated 5 months ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 2 years ago
- A coreference evaluation package for the CoNLL and ARRAU datasets☆40Updated 4 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Updated 4 years ago
- Tool for parsing and converting various span encoding schemes.☆23Updated last year
- ☆15Updated 4 years ago
- ☆66Updated 4 years ago
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also pred…☆70Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago