☆14Jun 9, 2021Updated 4 years ago
Alternatives and similar repositories for dcase2021_task1a_baseline
Users that are interested in dcase2021_task1a_baseline are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- ☆15Oct 15, 2020Updated 5 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆16Jul 8, 2020Updated 5 years ago
- Keras/Pytorch neural network size, operations and parameters counter☆16Mar 23, 2023Updated 2 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 2 weeks ago
- ☆16Apr 11, 2019Updated 6 years ago
- Baseline of DCASE 2020 task 4☆43Oct 24, 2022Updated 3 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Paderborn Sound Event Detection☆78Jul 18, 2023Updated 2 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- audioLIME: Listenable Explanations Using Source Separation☆37Jul 22, 2021Updated 4 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆30Nov 19, 2019Updated 6 years ago
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- [AAAI 2026 Oral] The official GitHub page of "PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Bas…☆40Jan 30, 2026Updated last month
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆10Jun 7, 2022Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 2 weeks ago
- Simple distance sampling analysis☆12Oct 17, 2025Updated 4 months ago
- Code for DCASE 2020 task 1a and task 1b.☆88Jan 20, 2022Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- Toolkit for downloading and processing Google's AudioSet dataset.☆176Aug 22, 2025Updated 6 months ago
- COLA contrastive pre-training method implemented in PyTorch☆43Jan 27, 2021Updated 5 years ago
- Sound2Synth Plug-Ins☆13Jul 28, 2022Updated 3 years ago
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 5 years ago
- keras implementation of A Discriminative Feature Learning Approach for Deep Face Recognition based on MNIST☆10Mar 1, 2019Updated 7 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- Tools to convert sigsep mus dataset from STEMS <-> WAV☆11Jul 15, 2020Updated 5 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 7 months ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year