lwang114 / GraphUnsupASRLinks
☆9Updated last year
Alternatives and similar repositories for GraphUnsupASR
Users that are interested in GraphUnsupASR are comparing it to the libraries listed below
Sorting:
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆13Updated 2 weeks ago
- [ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Reco…☆11Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 10 months ago
- ☆13Updated 11 months ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- ☆20Updated last year
- ☆14Updated last year
- ☆14Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆19Updated 2 years ago
- ☆21Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 11 months ago
- Collection of scripts from mHuBERT-147.☆29Updated 8 months ago
- Visual Speech Recongnition☆18Updated 7 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 8 months ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Updated last year
- ☆11Updated last year
- ☆34Updated 4 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- ☆9Updated 5 years ago
- Temporary anonymous version☆22Updated last year
- ☆11Updated last year
- GPT for FACodec☆13Updated last year
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆47Updated 3 months ago
- ☆15Updated 4 years ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆21Updated 6 months ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 2 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Updated last year