RiTA-nlp / ITALIC
ITALIC: An ITALian Intent Classification Dataset
☆11Updated last year
Alternatives and similar repositories for ITALIC:
Users that are interested in ITALIC are comparing it to the libraries listed below
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 2 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated 9 months ago
- ☆11Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆20Updated 5 months ago
- Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data☆25Updated last month
- ☆13Updated 11 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆77Updated 4 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 4 months ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆20Updated last year
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆47Updated 2 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆43Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆64Updated 11 months ago
- ☆11Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 2 weeks ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆112Updated 2 months ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 2 years ago
- ☆35Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆73Updated 3 years ago
- Generating artificial disfluencies from fluent text easily and promptly☆13Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- ☆41Updated 2 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆71Updated last year
- Multi-task modelling extensions for huggingface transformers☆13Updated last year
- ☆56Updated 2 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 2 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated 4 months ago
- ☆42Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆20Updated 10 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 7 months ago