biaofuxmu / wav2vec-SLinks
Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"
☆10Updated 8 months ago
Alternatives and similar repositories for wav2vec-S
Users that are interested in wav2vec-S are comparing it to the libraries listed below
Sorting:
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆20Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- ☆14Updated 5 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated last year
- ☆58Updated last year
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- ☆23Updated 2 years ago
- Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))☆48Updated last year
- ☆20Updated last year
- Datasets for turn-taking research☆17Updated 2 years ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 3 years ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆25Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆40Updated 2 years ago
- ☆11Updated 2 years ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Updated 2 months ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆14Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 10 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Updated last year
- Unsupervised video dubbing project☆40Updated 5 years ago
- speaker-disentangled speech linguistic content quantizer☆24Updated 9 months ago
- Official PyTorch implementation of TTS Style Transfer☆25Updated 3 years ago
- ☆14Updated 2 years ago
- ☆15Updated last year
- A simple voice conversion tool☆19Updated 3 years ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Updated last year