dreamtheater123 / VoxEvalLinks
Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models
☆9Updated last week
Alternatives and similar repositories for VoxEval
Users that are interested in VoxEval are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆15Updated 2 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- text to speech☆10Updated last year
- ☆13Updated 10 months ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 7 months ago
- Code for ICML25 Paper "Overcoming Non-monotonicity in Transducer-based Streaming Generation"☆11Updated last month
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆35Updated last year
- ☆16Updated 3 weeks ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆13Updated 6 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆25Updated last year
- A TTS Trained on Universal Audio.☆34Updated 2 weeks ago
- Text-to-Speech Latency Benchmark☆14Updated this week
- ☆14Updated 2 years ago
- ☆10Updated 7 months ago
- ☆19Updated last year
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 11 months ago
- ☆12Updated 4 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆17Updated 11 months ago
- Production-ready vocoder using BigVSAN☆11Updated last year
- ☆10Updated 7 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 8 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆13Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- ☆28Updated 4 months ago
- Collection of scripts from mHuBERT-147.☆27Updated 7 months ago
- ☆12Updated last year
- ☆13Updated 7 months ago