SpectroMap is a peak detection algorithm that computes the constellation map for a given signal
☆33Jun 19, 2024Updated last year
Alternatives and similar repositories for SpectroMap
Users that are interested in SpectroMap are comparing it to the libraries listed below
Sorting:
- ☆22Oct 12, 2023Updated 2 years ago
- A simple audio fingerprinting system☆34Aug 27, 2022Updated 3 years ago
- ☆23Aug 30, 2022Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- automatic audio labelling with laion-clap☆21Jun 20, 2024Updated last year
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆24Nov 25, 2023Updated 2 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- ☆28Jul 31, 2025Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆13Oct 28, 2023Updated 2 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆16Jan 16, 2025Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated this week
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- Starter template for an online book or docs site made with Markdown and mdBook 🦀 📙☆13Nov 14, 2022Updated 3 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Aug 6, 2020Updated 5 years ago
- A Medical / Clinical Note Taking Demo Application using Deepgram Voice Agent API☆14Jul 9, 2025Updated 8 months ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆18Apr 30, 2022Updated 3 years ago
- ☆74Apr 4, 2024Updated last year
- [ICML 2024] UGrid: An Efficient-And-Rigorous Neural Multigrid Solver for Linear PDEs☆10Aug 7, 2025Updated 7 months ago
- BenchBench is a Python package to evaluate multi-task benchmarks.☆18Oct 12, 2025Updated 5 months ago
- SUpDEq - Spatial Upsampling by Directional Equalization☆33May 6, 2025Updated 10 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Jul 16, 2024Updated last year
- A repository for my MSc thesis in Data Science & Machine Learning @ NTUA. A deep learning approach to audio fingerprinting for recognizin…☆50Nov 12, 2024Updated last year
- ☆43Jan 13, 2025Updated last year
- Official implementation of Neural Audio Fingerprint (ICASSP 2021)☆203Aug 21, 2025Updated 7 months ago
- ☆19Dec 8, 2020Updated 5 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆14Nov 24, 2023Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆47Nov 19, 2024Updated last year
- Tiny UTF-8 ANSI/VT102 terminal abstraction in C☆20Aug 19, 2014Updated 11 years ago