Speech-Interaction-Technology-Aalto-U / itspView external linksLinks
Introduction to Speech Processing
☆113Oct 31, 2025Updated 3 months ago
Alternatives and similar repositories for itsp
Users that are interested in itsp are comparing it to the libraries listed below
Sorting:
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming☆62Jun 30, 2025Updated 7 months ago
- Neural IIR Filter Field for HRTF Upsampling and Personalization☆26Feb 26, 2024Updated last year
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆163Aug 5, 2022Updated 3 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Official implementation of INTERSPECCH 2022 Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals☆16Sep 19, 2025Updated 4 months ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆16Nov 19, 2024Updated last year
- SongDriver uses a parallel mechanism of prediction and arrangement phases to achieve zero logical latency in real-time accompaniment gene…☆14Jan 5, 2026Updated last month
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆54Jan 16, 2026Updated last month
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Apr 13, 2023Updated 2 years ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Feb 10, 2026Updated last week
- ☆13Jan 14, 2025Updated last year
- The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.☆16Aug 12, 2025Updated 6 months ago
- Sound field reconstruction using neural processes with dynamic kernels☆15Mar 25, 2025Updated 10 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Cover Song Detection System☆10Mar 29, 2019Updated 6 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆59Jul 1, 2025Updated 7 months ago
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 2 years ago
- ☆29Jun 8, 2023Updated 2 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 5 months ago
- Phoneme segmentation using pre-trained speech models☆55Nov 4, 2022Updated 3 years ago
- ☆61Oct 28, 2024Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆32Apr 22, 2024Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆63Feb 15, 2024Updated 2 years ago
- Praat script for automatic formant optimization☆15Jan 27, 2023Updated 3 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Trainable algorithm for automatic measurement of voice onset time☆67Jul 26, 2023Updated 2 years ago
- Praat-based tools for spectral analysis☆34Feb 7, 2026Updated last week
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 8 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- A Chinese version of A Neural Parametric Singing Synthesizer☆13Feb 12, 2022Updated 4 years ago