SapienzaNLP / ita-bench
A collection of Italian benchmarks for LLM evaluation
☆20Updated last week
Related projects ⓘ
Alternatives and complementary repositories for ita-bench
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆24Updated 4 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated last month
- This repository hosts materials from the CLiC-IT 2023 tutorial☆27Updated 5 months ago
- Evaluation of language models on mono- or multilingual tasks.☆74Updated this week
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆104Updated last year
- ☆15Updated 3 years ago
- Word Sense Linking model is designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.☆11Updated 2 months ago
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆13Updated 2 months ago
- ☆37Updated 10 months ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆211Updated last month
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 6 months ago
- GilBERTo: A pretrained language model based on RoBERTa for Italian☆73Updated 4 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 4 months ago
- Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).☆15Updated 2 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Compass-aligned Distributional Embeddings. Align embeddings from different corpora☆38Updated last year
- A Scandinavian Benchmark for sentence embeddings☆27Updated last month
- Python for Natural Language Processing☆19Updated last week
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆79Updated last year
- ☆147Updated 4 months ago
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆126Updated 11 months ago
- quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected…☆23Updated 4 years ago
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…☆24Updated 9 months ago
- SpanMarker for Named Entity Recognition☆398Updated 3 months ago
- Data for the HIPE 2022 shared task.☆15Updated 11 months ago