ALM-LAB / PACE
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PACE
- Repository for the LLM course☆11Updated this week
- ITALIC: An ITALian Intent Classification Dataset☆11Updated 11 months ago
- Pre-training BART model for the Italian Language☆15Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- ☆61Updated 3 months ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 2 months ago
- ☆54Updated this week
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆134Updated 10 months ago
- Speaker Diarization with Transformers☆59Updated 6 months ago
- babyLM WhisBERT code☆17Updated 5 months ago
- ☆16Updated last month
- Code for Zero-Shot Tokenizer Transfer☆115Updated last month
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆53Updated 3 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆141Updated last year
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆80Updated this week
- Repository contains code to fine-tune WhisperASR model☆23Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- ☆40Updated 2 years ago
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆47Updated last year
- GlotCC Dataset and Pipline -- NeurIPS 2024☆16Updated 3 weeks ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆92Updated 3 weeks ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- ☆152Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆24Updated 3 weeks ago
- ☆347Updated 8 months ago
- ☆87Updated 10 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆11Updated 8 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆24Updated last year
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago