vad
☆26Apr 3, 2023Updated 3 years ago
Alternatives and similar repositories for vap_turn_taking
Users that are interested in vap_turn_taking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets for turn-taking research☆19Dec 21, 2023Updated 2 years ago
- Voice Activity Projection Models: Self-supervised learning of Turn-taking Events☆103May 29, 2024Updated 2 years ago
- ☆16Aug 19, 2023Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆69May 18, 2024Updated 2 years ago
- Online Detection of Action Start in Untrimmed, Streaming Videos☆12Sep 1, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Sep 3, 2018Updated 7 years ago
- Kanji Converter to Hiragana, Katakana, Roman alphabet.☆20Oct 30, 2025Updated 7 months ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆26Jan 12, 2025Updated last year
- ☆18Apr 28, 2023Updated 3 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- This is an official PyTorch implementation of "Gesture2Vec: Clustering Gestures using Representation Learning Methods for Co-speech Gestu…☆27Feb 9, 2024Updated 2 years ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆26May 28, 2025Updated last year
- Core repository of the retico framework providing the basic functionality of incremental processing.☆12May 18, 2026Updated last month
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆49Jan 31, 2024Updated 2 years ago
- ☆20Aug 20, 2025Updated 9 months ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆52Mar 12, 2021Updated 5 years ago
- ☆25Aug 29, 2025Updated 9 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆23Apr 18, 2022Updated 4 years ago
- Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""☆34Apr 25, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 3 months ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Aug 8, 2023Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Sep 23, 2020Updated 5 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39May 5, 2026Updated last month
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆61May 29, 2023Updated 3 years ago
- ☆33Dec 30, 2025Updated 5 months ago
- Unofficial implementation of miipher☆136Apr 19, 2024Updated 2 years ago
- ☆14Jul 5, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆15Dec 3, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Nov 24, 2022Updated 3 years ago
- Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' (TOMM 2023)☆10Sep 6, 2025Updated 9 months ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 3 years ago
- ☆12Aug 25, 2023Updated 2 years ago
- ☆14Apr 29, 2025Updated last year
- ☆51Nov 24, 2022Updated 3 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated 2 years ago