RiTA-nlp / ITALICLinks

ITALIC: An ITALian Intent Classification Dataset

☆14

Alternatives and similar repositories for ITALIC

Users that are interested in ITALIC are comparing it to the libraries listed below

Sorting:

hlt-mt / Speech-MASSIVE
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆22Updated 10 months ago
aixplain / NoRefER
☆16Updated last year
german-asr / megs
A merged version of multiple open-source German speech datasets.
☆31Updated last year
mt-upc / SHAS
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆38Updated 2 years ago
Splend1d / T5lephone
Code for T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5
☆19Updated 2 years ago
hlt-mt / FBK-fairseq
Repository containing the open source code of works published at the FBK MT unit.
☆46Updated last week
HLasse / multidiagnosis-speech
☆11Updated 2 years ago
besacier / ASR2022
☆56Updated 2 years ago
huangruizhe / ConEC
☆12Updated last year
koudounasalkis / voc2vec
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
☆34Updated 2 months ago
sigmorphon / 2022SegmentationST
SIGMORPHON 2022 Shared Task on Morpheme Segmentation
☆26Updated 2 years ago
asappresearch / slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆65Updated last year
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
☆19Updated last year
xinjli / phonepiece
phone inventory library
☆16Updated 2 years ago
qcri / e-wer
Word Error Rate Estimation
☆13Updated 4 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆21Updated last year
xinjli / ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆42Updated 2 years ago
dayanavivolab / s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Updated last year
amritkromana / disfluency_detection_from_audio
☆22Updated 10 months ago
umbertocappellazzo / Llama-AVSR
[ICASSP 2025] Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners".
☆25Updated 3 months ago
IamAdiSri / hf-trim
Reduce the size of pretrained Hugging Face models via vocabulary trimming.
☆45Updated 2 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
Labbeti / aac-metrics
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆51Updated 2 months ago
RuABraun / texterrors
☆37Updated 2 months ago
asappresearch / wav2seq
Official code for Wav2Seq
☆96Updated 2 years ago
patrickvonplaten / Wav2Vec2_PyCTCDecode
Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode
☆111Updated 2 years ago
xinjli / asr2k
asr2k
☆50Updated last year
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated 2 years ago
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆67Updated last month
voidful / SpeechMix
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together
☆47Updated 2 years ago