crux82 / squad-it
A large scale dataset for Question Answering in Italian
โ26Updated 6 years ago
Alternatives and similar repositories for squad-it:
Users that are interested in squad-it are comparing it to the libraries listed below
- AlBERTo the first italian BERT model for Twitter languange understandingโ72Updated 4 years ago
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ18Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ80Updated 8 months ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.โ104Updated 2 years ago
- GilBERTo: A pretrained language model based on RoBERTa for Italianโ72Updated 5 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ151Updated 10 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐นโ30Updated 9 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)โ48Updated 3 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.โ68Updated 3 years ago
- BERT models for many languages created from Wikipedia textsโ33Updated 4 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ102Updated 2 years ago
- spaCy + UDPipeโ161Updated 2 years ago
- Advanced NLP Workshop: word-sense disambiguation with RoBERTa and text summarization with BART (Machine Learning Milan)โ27Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ66Updated 2 years ago
- negate_sentence(A Python module that doesn't negate sentences.)โ30Updated 5 months ago
- Visualise, evaluate, and manage annotated dataโ33Updated 2 years ago
- Reimplementation of a BERT based model (Shi et al, 2019), currently the state-of-the-art for English SRL. This model implements also predโฆโ70Updated 3 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"โ54Updated 3 years ago
- A simple neural truecaser written in pytorch and allennlp.โ33Updated 9 months ago
- spaCy match and replace, maintaining conjugationโ35Updated 2 years ago
- โ64Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.โ59Updated 2 years ago
- โ35Updated 3 years ago
- The Italian NLP Toolโ70Updated last year
- Fine-tune transformers with pytorch-lightningโ44Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.โ105Updated 11 months ago
- โ [NOT MAINTAINED] A web-based annotator for closed-domain question answering datasets with SQuAD format.โ88Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsโ65Updated 2 years ago
- ๐ธ KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddingsโ58Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithmโ14Updated 2 years ago