crux82 / squad-itLinks
A large scale dataset for Question Answering in Italian
☆27Updated 7 years ago
Alternatives and similar repositories for squad-it
Users that are interested in squad-it are comparing it to the libraries listed below
Sorting:
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆69Updated 4 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆110Updated 2 years ago
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 3 years ago
- GilBERTo: A pretrained language model based on RoBERTa for Italian☆73Updated 5 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- ☆45Updated 3 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 5 years ago
- A python true casing utility that restores case information for texts☆89Updated 3 years ago
- ☆64Updated 2 years ago
- Fast computation of Krippendorff's alpha agreement measure in Python.☆153Updated last week
- ☆15Updated 4 years ago
- ☆50Updated last year
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- An easy-to-use library to extract indices from texts.☆29Updated 4 years ago
- spaCy + UDPipe☆163Updated 3 years ago
- LASER multilingual sentence embeddings as a pip package☆225Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆153Updated last week
- ☆35Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆96Updated 8 months ago
- BERT fine-tuning for POS tagging task (Keras)☆79Updated 6 years ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆160Updated 2 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/☆193Updated 2 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆102Updated last year
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)☆54Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆165Updated 2 years ago