ALM-LAB / PACELinks
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
☆17Updated 3 years ago
Alternatives and similar repositories for PACE
Users that are interested in PACE are comparing it to the libraries listed below
Sorting:
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- ☆62Updated last year
- ☆357Updated last year
- Speaker Diarization with Transformers☆70Updated 7 months ago
- ☆157Updated 2 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆23Updated last year
- Speaker diarization model☆32Updated 2 years ago
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated 2 years ago
- ☆34Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆63Updated last year
- A simple, consistent and extendable toolkit for IndicTrans2. (Pypi: https://pypi.org/project/indictranstoolkit)☆37Updated 6 months ago
- Collection of Open Source Speech Data☆164Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- ☆323Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆216Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Audio search using Azure Cognitive Search☆26Updated 2 months ago
- vad☆25Updated 2 years ago
- Synthetic Dialog Generation and Analysis with LLMs☆124Updated this week
- Repository contains code to fine-tune WhisperASR model☆23Updated 3 years ago
- An Education Tutoring Chatbot based on Learning Science Principles powered by Large Language Models☆55Updated last year
- A streaming whisper server for on-prem transcription☆23Updated last year
- Transcription with speaker diarization pipeline☆98Updated 2 years ago
- QueryGod lets you interact with any API or database using natural language. Writing simple prompts you can chain together the execution o…☆14Updated 3 years ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- proof of concept conversation orchestrator with a speech-language model☆20Updated last year
- Speaker diarization service☆25Updated 7 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated 6 months ago