qiuqiangkong / materials_for_students
☆12Updated last year
Alternatives and similar repositories for materials_for_students:
Users that are interested in materials_for_students are comparing it to the libraries listed below
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 11 months ago
- ☆17Updated this week
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆48Updated last week
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 2 months ago
- ☆63Updated last year
- ARCH: Audio Representations benCHmark☆39Updated 4 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆47Updated 2 months ago
- Query-conditioned target sound extraction model☆18Updated 2 months ago
- ☆48Updated 2 months ago
- ☆23Updated this week
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆34Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆23Updated 3 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆76Updated 3 weeks ago
- ☆45Updated last month
- Implementation of SpatialCodec.☆55Updated last year
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆33Updated 7 months ago
- ☆32Updated 3 weeks ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆49Updated last week
- This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…☆15Updated last year
- ☆28Updated last month
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆14Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆58Updated last month
- ☆14Updated 6 months ago
- Codec for paper: LLaSA: Scaling Train Time and Test Time Compute for LLaMA based Speech Synthesis.☆63Updated this week
- ☆17Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆21Updated last year
- ☆48Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆111Updated last month