NTU-CCA/EE6401

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NTU-CCA/EE6401)

NTU-CCA / EE6401

EE6401 Advanced Digital Signal Processing

☆18

Alternatives and similar repositories for EE6401

Users that are interested in EE6401 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sarulab-speech / SpatialCLAP
View on GitHub
☆19Oct 9, 2025Updated 9 months ago
yweweler / single-speaker-tts
View on GitHub
This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron a…
☆12Dec 28, 2018Updated 7 years ago
tikikun / f5-tts-mlx-quantized
View on GitHub
Implementation of F5-TTS in MLX
☆14Dec 13, 2024Updated last year
sakshamsingh1 / sound_distance_estimation
View on GitHub
Official implementation of "sound distance estimation" WASPAA 23
☆20Dec 31, 2023Updated 2 years ago
yumy-wang / NTU_ml2021spring
View on GitHub
NTU2021春季机器学习课程笔记和代码
☆12Aug 17, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TomohikoNakamura / asteroid_jaCappella
View on GitHub
☆14Jul 28, 2023Updated 2 years ago
Jinbo-Hu / SELD-Data-Generator
View on GitHub
Data generator for sound event localization and detection clips, including 4-ch microphone-array-format signals and first-order-ambisonic…
☆22Nov 13, 2024Updated last year
JorisCos / VCTK-2Mix
View on GitHub
☆19Jul 12, 2020Updated 6 years ago
spatialaudio / python-sofa
View on GitHub
A python API for reading and writing SOFA files (https://www.sofaconventions.org/)
☆28Mar 8, 2021Updated 5 years ago
donghoney0416 / DeepASA
View on GitHub
Official page of "DeepASA: An Object-Oriented Multi-Purpose Network for Auditory Scene Analysis"
☆26Apr 15, 2026Updated 3 months ago
bfshi / ARML_Auxiliary_Task_Reweighting
View on GitHub
Code for our paper "Auxiliary Task Reweighting for Minimum-data Learning" (NeurIPS 2020)
☆18Dec 21, 2020Updated 5 years ago
essencevc / cyoa
View on GitHub
Choose your own adventure with LLMs
☆23May 27, 2025Updated last year
kuleshov-group / proseco
View on GitHub
Learn from Your Mistakes: Self-Correcting Masked Diffusion Models
☆15Jun 25, 2026Updated 3 weeks ago
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆38May 20, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Qinyu-Allen-Zhao / Arinar
View on GitHub
☆43May 30, 2025Updated last year
inoueakimitsu / ExcelAgentTemplate
View on GitHub
Sample Excel add-in and Python script code to run an agent using LLM from an Excel function
☆20Jul 16, 2024Updated 2 years ago
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
sfcompute / tinynarrations
View on GitHub
A synthetic story narration dataset to study small audio LMs.
☆31Jan 21, 2024Updated 2 years ago
axeber01 / ngcc-seld
View on GitHub
Sound Event Localization and Detection using Neural Generalized Cross-Correlations
☆35Feb 11, 2025Updated last year
michaelneri / audio-distance-estimation
View on GitHub
Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …
☆40Jun 29, 2026Updated 3 weeks ago
xuyu0010 / ARID_v1
View on GitHub
A baseline demo for ARID Dataset
☆27Nov 8, 2021Updated 4 years ago
Jasson-Chen / Add_noise_and_rir_to_speech
View on GitHub
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…
☆31Sep 21, 2021Updated 4 years ago
Jinbo-Hu / PSELDNets
View on GitHub
PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection
☆46Sep 17, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
cantabile-kwok / vec2wav2.0
View on GitHub
Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995
☆79Dec 3, 2024Updated last year
dianwen-ng / Keyword-Spotting-ConvMixer
View on GitHub
☆33Aug 10, 2022Updated 3 years ago
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago
shkim0116 / KLASS
View on GitHub
[NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Guided Fast Inference in Masked Diffusion Models"
☆33Jan 3, 2026Updated 6 months ago
robatwilliams / openai-excel-functions
View on GitHub
Create OpenAI chat completions from Excel formulas
☆44Jan 22, 2024Updated 2 years ago
chentuochao / Sound_Bubble
View on GitHub
Project for speech bubble
☆66Aug 15, 2025Updated 11 months ago
marl / SpatialScaper
View on GitHub
☆75Aug 7, 2025Updated 11 months ago
dmis-lab / PerceiverCPI
View on GitHub
Bioinformatics'2022 PerceiverCPI: A nested cross-attention network for compound-protein interaction prediction
☆39Nov 9, 2023Updated 2 years ago
Kinyugo / torch_mdct
View on GitHub
A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.
☆33Dec 17, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kyungmnlee / dmf
View on GitHub
Official implementation of Decoupled MeanFlow
☆44Oct 28, 2025Updated 8 months ago
PVIT-official / PVIT
View on GitHub
Repository of paper: Position-Enhanced Visual Instruction Tuning for Multimodal Large Language Models
☆37Sep 19, 2023Updated 2 years ago
SpeechColab / GigaSpeech2
View on GitHub
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
☆197Apr 28, 2026Updated 2 months ago
sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
gsig / actor-observer
View on GitHub
ActorObserverNet code in PyTorch from "Actor and Observer: Joint Modeling of First and Third-Person Videos", CVPR 2018
☆84Mar 8, 2019Updated 7 years ago
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
mvp-ai-lab / RAVEN
View on GitHub
Implementation of our paper "RAVEN: Real-time Autoregressive Video Extrapolation with Consistency-model GRPO"
☆52Jul 11, 2026Updated last week