Speech-Interaction-Technology-Aalto-U/itsp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Speech-Interaction-Technology-Aalto-U/itsp)

Speech-Interaction-Technology-Aalto-U / itsp

Introduction to Speech Processing

☆122

Alternatives and similar repositories for itsp

Users that are interested in itsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

megseekosh / dsp_tutorials
View on GitHub
I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …
☆12Feb 5, 2024Updated 2 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
Chutlhu / EUSIPCO25_PIML_tutorial
View on GitHub
☆18Sep 8, 2025Updated 10 months ago
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
sivannavis / samo
View on GitHub
SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING
☆42Apr 5, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
b-sigpro / neural-fcasa
View on GitHub
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
☆40Mar 12, 2025Updated last year
finkelbert / ProPer_Projekt
View on GitHub
A workflow for acoustic analysis of speech prosody based on continuous measurements of periodic energy and F0 (requires Praat and R). Pro…
☆15Apr 29, 2026Updated 2 months ago
xmos / fwk_voice
View on GitHub
Voice Framework
☆18Jan 21, 2026Updated 6 months ago
chdh / klatt-syn
View on GitHub
Klatt formant synthesizer
☆76Jun 26, 2026Updated 3 weeks ago
Audio-Experience-Design / LAPChallenge
View on GitHub
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
☆16Aug 12, 2025Updated 11 months ago
linjac / GenDARA
View on GitHub
☆13Jan 14, 2025Updated last year
sh01k / imp_tsp
View on GitHub
Measuring impulse response with time-stretched pulse (TSP) signal
☆14Jul 3, 2019Updated 7 years ago
kirbyj / praatsauce
View on GitHub
Praat-based tools for spectral analysis
☆37May 28, 2026Updated last month
merlresearch / neural-IIR-field
View on GitHub
Neural IIR Filter Field for HRTF Upsampling and Personalization
☆29Feb 26, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
scottishfold0621 / ACMID
View on GitHub
☆26Apr 30, 2026Updated 2 months ago
soskuthy / gamm_strategies
View on GitHub
Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"
☆10Jan 25, 2021Updated 5 years ago
kirbyj / praatdet
View on GitHub
Praat-based tools for EGG analysis
☆20Sep 21, 2023Updated 2 years ago
multimedia-berkeley / deep_hashing_coverSongDetection
View on GitHub
Cover Song Detection System
☆10Mar 29, 2019Updated 7 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
taishi-n / torchrir
View on GitHub
PyTorch-based room impulse response (RIR) simulation toolkit with dynamic scenes, GPU acceleration.
☆22Feb 18, 2026Updated 5 months ago
Sreyan88 / Toxicity-Detection-in-Spoken-Utterances
View on GitHub
This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utteranc…
☆21Oct 13, 2022Updated 3 years ago
yukara-ikemiya / floss-torch
View on GitHub
PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind
☆96Nov 24, 2025Updated 7 months ago
ChristopherCarignan / formant-optimization
View on GitHub
Praat script for automatic formant optimization
☆15Jan 27, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lstrgar / ss-phoneme-seg
View on GitHub
Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…
☆55Nov 4, 2022Updated 3 years ago
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
zqwang7 / CausalityCheck
View on GitHub
Causality Check in Frame-online Speech Separation
☆51Dec 11, 2022Updated 3 years ago
maxrmorrison / clpcnet
View on GitHub
Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)
☆166Aug 5, 2022Updated 3 years ago
mlml / autovot
View on GitHub
Trainable algorithm for automatic measurement of voice onset time
☆69Jul 26, 2023Updated 2 years ago
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago
intro2ddsp / intro2ddsp.github.io
View on GitHub
A Jupyter book accompanying the ISMIR 2023 tutorial Introduction to DIfferentiable Audio Synthesiser Programming
☆62Jun 30, 2025Updated last year
drfeinberg / Parselmouth-Guides
View on GitHub
These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth
☆43Sep 29, 2021Updated 4 years ago
ZhongshuHou / LSA
View on GitHub
Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)
☆28Sep 16, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
alexandergwm / Classical-Sound-Source-Localization-Algorithms-in-Spherical-Domain
View on GitHub
Here is a repository stored the classical sound source localization algorithms in spherical domain, namely, PWD, DAS, SHMUSIC, SHMVDR, S…
☆23Nov 16, 2023Updated 2 years ago
fakufaku / diffusion-separation
View on GitHub
Single channel speech source separation by diffusion process (ICASSP 2023)
☆126Mar 15, 2024Updated 2 years ago
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year
lucidrains / rvq-vae-gpt
View on GitHub
My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation
☆90Oct 11, 2024Updated last year
Legisign / Praat-textgrids
View on GitHub
Praat textgrid manipulation in Python
☆55Apr 3, 2025Updated last year
zerong7777-boop / gtcrn-light
View on GitHub
Operator-level compressed GTCRN with ERB-CRM pipeline preserved and DPGRNN intact, ready for edge deployment.
☆22Feb 11, 2026Updated 5 months ago
jiemojiemo / rubberband_pitch_shift_plugin
View on GitHub
A Pitch shifter plugin implementation using JUCE and rubberband
☆19Jan 29, 2023Updated 3 years ago