dbdmg / llm
Repository for the LLM course
☆11Updated this week
Related projects ⓘ
Alternatives and complementary repositories for llm
- ITALIC: An ITALian Intent Classification Dataset☆11Updated 11 months ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆12Updated last year
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆13Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- A merged version of multiple open-source German speech datasets.☆30Updated 6 months ago
- Pre-training BART model for the Italian Language☆15Updated last year
- African accented clinical and general domain TTS☆9Updated 5 months ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆10Updated 4 months ago
- ☆11Updated 2 years ago
- ☆14Updated last year
- ☆37Updated 11 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆134Updated 10 months ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- ☆54Updated this week
- Library for pruning experts per language pair in NLLB-200☆30Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 5 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆12Updated this week
- babyLM WhisBERT code☆17Updated 5 months ago
- scipts for working with open.bible data☆23Updated 2 years ago
- The official code for the SALMon🍣 benchmark☆40Updated 2 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆80Updated 3 months ago
- EMO-SUPERB submission☆28Updated 2 months ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆13Updated last year
- ☆38Updated 2 years ago
- ☆11Updated last year
- ☆85Updated 7 months ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆18Updated 11 months ago
- ☆40Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆53Updated 3 months ago