ALM-LAB / PACE
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
☆14Updated 2 years ago
Alternatives and similar repositories for PACE:
Users that are interested in PACE are comparing it to the libraries listed below
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆140Updated last year
- Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023☆50Updated last year
- ☆153Updated last year
- ITALIC: An ITALian Intent Classification Dataset☆11Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 8 months ago
- Speaker Diarization with Transformers☆64Updated 8 months ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 2 years ago
- ☆62Updated 6 months ago
- ☆348Updated 10 months ago
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆105Updated last week
- ☆65Updated 2 months ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 4 months ago
- Pre-training BART model for the Italian Language☆15Updated 2 years ago
- ☆18Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆20Updated 5 months ago
- ☆13Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Simple Diarization model☆46Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆55Updated 3 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 2 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆102Updated last month
- Repository for the LLM course☆14Updated last month
- ☆41Updated 2 years ago
- A simple, consistent and extendable toolkit for IndicTrans2☆21Updated 2 weeks ago
- Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆111Updated 2 months ago
- A python package for whisper normalizer☆46Updated 2 months ago
- ☆16Updated 8 months ago
- ☆269Updated 7 months ago
- Speaker diarization model☆23Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages☆98Updated 3 months ago