SapienzaNLP / ita-bench
A collection of Italian benchmarks for LLM evaluation
☆30Updated last week
Alternatives and similar repositories for ita-bench:
Users that are interested in ita-bench are comparing it to the libraries listed below
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 7 months ago
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆26Updated 9 months ago
- This repository hosts materials from the CLiC-IT 2023 tutorial☆30Updated 10 months ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆12Updated this week
- ☆15Updated 4 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆105Updated 2 years ago
- Word Sense Linking model is designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory.☆11Updated 8 months ago
- FENICE (Factuality Evaluation of Summarization based on Natural Language Inference and Claim Extraction) is a factuality-oriented metric …☆18Updated 4 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 10 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆10Updated last month
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Updated last week
- Code and models for the COLING2020 paper "Bridging the Gap in Multilingual Semantic Role Labeling: a Language-Agnostic Approach".☆12Updated 2 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Updated 2 weeks ago
- ☆37Updated last year
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)☆22Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- ☆35Updated 3 years ago
- SciRepEval benchmark training and evaluation scripts☆73Updated 11 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆212Updated 6 months ago
- Resources for cultural NLP research☆92Updated this week
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆66Updated 3 months ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- ☆8Updated 2 years ago
- Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).☆15Updated 3 years ago
- ☆49Updated last week
- Data for the HIPE 2022 shared task.☆17Updated last year
- Dataset containing scroll interactions of 598 partcipants reading advanced and elementary texts from the OneStopEnglish corpus☆16Updated 3 years ago
- Package to extract connotation frames☆84Updated last year
- Utility for behavioral and representational analyses of Language Models☆138Updated this week