dbdmg / llm
Repository for the LLM course
☆11Updated last week
Related projects ⓘ
Alternatives and complementary repositories for llm
- ITALIC: An ITALian Intent Classification Dataset☆11Updated 11 months ago
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆13Updated last year
- Pre-training BART model for the Italian Language☆15Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆12Updated 6 months ago
- babyLM WhisBERT code☆17Updated 5 months ago
- ☆11Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- ☆14Updated last year
- ☆52Updated 2 weeks ago
- Generating artificial disfluencies from fluent text easily and promptly☆12Updated 2 years ago
- ☆37Updated 10 months ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆12Updated last year
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆17Updated 2 years ago
- African accented clinical and general domain TTS☆9Updated 5 months ago
- ☆11Updated 6 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆77Updated 3 months ago
- The official code for the SALMon🍣 benchmark☆40Updated last month
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆132Updated 9 months ago
- This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recogniti…☆111Updated last month
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆16Updated 10 months ago
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆89Updated last week
- ☆12Updated 8 months ago
- ☆84Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- A HuggingFace compatible Small Language Model trainer.☆73Updated 3 weeks ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- ☆40Updated last year