crux82 / squad-itLinks
A large scale dataset for Question Answering in Italian
☆27Updated 6 years ago
Alternatives and similar repositories for squad-it
Users that are interested in squad-it are comparing it to the libraries listed below
Sorting:
- GilBERTo: A pretrained language model based on RoBERTa for Italian☆73Updated 5 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆106Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- ☆44Updated 3 years ago
- ☆64Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 5 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- spaCy + UDPipe☆162Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.☆24Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- ☆110Updated last year
- Data for the HIPE 2022 shared task.☆20Updated last year
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Updated 6 years ago
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- ☆15Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Updated 5 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last year
- ☆25Updated 5 years ago