vad
☆25Apr 3, 2023Updated 2 years ago
Alternatives and similar repositories for vap_turn_taking
Users that are interested in vap_turn_taking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Feb 9, 2023Updated 3 years ago
- Datasets for turn-taking research☆19Dec 21, 2023Updated 2 years ago
- ☆12Feb 16, 2024Updated 2 years ago
- ☆15Aug 19, 2023Updated 2 years ago
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆65May 18, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆96Jul 24, 2025Updated 8 months ago
- Online Detection of Action Start in Untrimmed, Streaming Videos☆12Sep 1, 2018Updated 7 years ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆24Jan 12, 2025Updated last year
- ☆19Apr 28, 2023Updated 2 years ago
- Universal differential equations for ecologists☆14Mar 2, 2026Updated 3 weeks ago
- ☆16Oct 7, 2022Updated 3 years ago
- VAD (Voice Activity Detection) tool☆15Sep 22, 2021Updated 4 years ago
- 7DRL 2023☆15May 27, 2023Updated 2 years ago
- ☆49Jan 31, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆50Mar 12, 2021Updated 5 years ago
- ☆25Aug 29, 2025Updated 7 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Mar 2, 2026Updated 3 weeks ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- Target Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning☆10Jul 2, 2019Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- ☆28Dec 30, 2025Updated 3 months ago
- This repository will contain links to the most famous available books of ML that are online☆12Oct 15, 2024Updated last year
- Unofficial implementation of miipher☆135Apr 19, 2024Updated last year
- ☆21Nov 24, 2022Updated 3 years ago
- Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' (TOMM 2023)☆10Sep 6, 2025Updated 6 months ago
- Official Implementation of implicit reference attack☆11Oct 16, 2024Updated last year
- ☆14Apr 29, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Aug 25, 2023Updated 2 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 2 years ago
- ☆50Nov 24, 2022Updated 3 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Jul 5, 2023Updated 2 years ago
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆17Mar 17, 2026Updated last week
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated 2 years ago