tandav / pitch-detectorsView external linksLinks
collection of pitch (f0, fundamental frequency) detection algorithms with unified interface
☆24Nov 25, 2024Updated last year
Alternatives and similar repositories for pitch-detectors
Users that are interested in pitch-detectors are comparing it to the libraries listed below
Sorting:
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss☆14Sep 4, 2023Updated 2 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆13Jul 22, 2024Updated last year
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 5 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Multispeaker Community Vocoder Model for DiffSinger☆39Aug 11, 2025Updated 6 months ago
- phase reconstruction from magnitude terms of an STFT☆13May 18, 2025Updated 8 months ago
- bin2bin, a Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment☆16Dec 29, 2023Updated 2 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- ☆13Jul 20, 2024Updated last year
- ☆18Jan 30, 2023Updated 3 years ago
- Papez: Resource-Efficient Speech Separation with Auditory Working Memory (ICASSP 2023)☆20Jun 25, 2023Updated 2 years ago
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- DistantSpeech☆21Oct 9, 2023Updated 2 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆93Sep 2, 2025Updated 5 months ago
- Pitch Controllable DDSP Vocoders☆78Nov 9, 2024Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Nov 12, 2025Updated 3 months ago
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆29Dec 5, 2025Updated 2 months ago
- ☆21Jul 15, 2024Updated last year
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆54Jan 16, 2026Updated last month
- Self-supervised learning for real-time pitch estimation☆275Oct 15, 2025Updated 4 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- ☆29Apr 17, 2023Updated 2 years ago
- Set of tools to work with scales, modes, modulations, chord progressions, voice leading, rhythm and more☆18Jan 19, 2025Updated last year
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Nov 25, 2023Updated 2 years ago
- Standalone real time dynamic vocal harmonizer☆25Nov 28, 2023Updated 2 years ago
- ☆57Dec 2, 2024Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆78Jun 8, 2025Updated 8 months ago
- ☆54Mar 2, 2023Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Mar 11, 2024Updated last year
- ONNX deployment of the CREPE pitch tracker☆26Oct 27, 2022Updated 3 years ago
- Streaming Vocos☆29Jun 10, 2025Updated 8 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆26Jan 6, 2024Updated 2 years ago