qiuqiangkong / materials_for_students
☆13Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for materials_for_students
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆66Updated 2 weeks ago
- ☆10Updated last month
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- ☆55Updated 11 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 weeks ago
- ARCH: Audio Representations benCHmark☆37Updated 2 months ago
- ☆47Updated last week
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆84Updated 2 months ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year
- A toolkit dedicate for speech evaluation.☆18Updated last month
- Query-conditioned target sound extraction model☆17Updated 3 weeks ago
- This repository follows papers and reports on discrete speech representation learning and speech tokenization methods for speech language…☆15Updated 11 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆31Updated 5 months ago
- Implementation of SpatialCodec.☆55Updated last year
- ☆51Updated last year
- Spherical residual vector quantization (SRVQ)☆26Updated 2 months ago
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆14Updated last year
- ☆47Updated 3 weeks ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- ☆20Updated 10 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆27Updated 2 weeks ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆22Updated last month
- ☆14Updated 4 months ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated 2 years ago
- ☆27Updated last year
- experiments about AudioSet☆43Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆32Updated last month
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago