koudounasalkis/Audio-Speech-Tutorial

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/koudounasalkis/Audio-Speech-Tutorial)

koudounasalkis / Audio-Speech-Tutorial

This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.

☆19

Alternatives and similar repositories for Audio-Speech-Tutorial

Users that are interested in Audio-Speech-Tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RiTA-nlp / ITALIC
View on GitHub
ITALIC: An ITALian Intent Classification Dataset
☆14Nov 24, 2023Updated 2 years ago
eleonorapoeta / benchmarking-KAN
View on GitHub
This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…
☆17Jul 10, 2024Updated 2 years ago
spapicchio / QATCH
View on GitHub
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
☆33Jul 17, 2025Updated last year
gallipoligiuseppe / TST-CycleGAN
View on GitHub
This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".
☆11Dec 2, 2024Updated last year
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
abikaki / awesome-speech-emotion-recognition
View on GitHub
😎 Awesome lists about Speech Emotion Recognition
☆101Dec 24, 2024Updated last year
dbdmg / llm
View on GitHub
Repository for the LLM course
☆31Jan 4, 2026Updated 6 months ago
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
zqlsnr / DPCRN
View on GitHub
real-time speech enhance
☆18Jan 23, 2024Updated 2 years ago
eipm / bridge2ai-redcap
View on GitHub
Bridge2AI Voice | REDCap
☆16Updated this week
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
skit-ai / Map-Mix
View on GitHub
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…
☆18Feb 17, 2023Updated 3 years ago
wbbeyourself / DTE
View on GitHub
Detect-Then-Explain Framework for Text-to-SQL task
☆10Dec 6, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
andi611 / Mockingjay-Speech-Representation
View on GitHub
Official Implementation of Mockingjay in Pytorch
☆55Jul 6, 2023Updated 3 years ago
Jiaxin-Ye / Emo-DNA
View on GitHub
[ACM MM 2023] Official PyTorch implementation of "Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Reco…
☆12Aug 4, 2023Updated 2 years ago
g8a9 / ear
View on GitHub
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆50May 31, 2022Updated 4 years ago
JusperLee / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆15Apr 22, 2020Updated 6 years ago
AdityaDutt / Audio-Classification-Using-Wavelet-Transform
View on GitHub
Classifying audio using Wavelet transform and deep learning
☆35Sep 5, 2021Updated 4 years ago
adlnlp / form_nlu
View on GitHub
☆19Nov 1, 2024Updated last year
koudounasalkis / voc2vec
View on GitHub
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
☆57Apr 14, 2025Updated last year
DarthReca / crop-field-segmentation-ukan
View on GitHub
☆37May 27, 2025Updated last year
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
PacktPublishing / Mastering-Unity-2D-Game-Development-Second-Edition
View on GitHub
Code repository for Mastering Unity 2D Game Development Second Edition, by Packt
☆29Jan 14, 2021Updated 5 years ago
TheDataStation / solo
View on GitHub
☆13Jan 8, 2025Updated last year
sumansamui / ECE715_Machine_Learning
View on GitHub
This repository contains all the content related to the machine learning course (ECE715) conducted at Dept. of ECE, NIT Durgapur
☆10Nov 15, 2024Updated last year
saparina / ambrosia
View on GitHub
𝔸𝕄𝔹ℝ𝕆𝕊𝕀𝔸: A Benchmark for Parsing Ambiguous Questions into Database Queries
☆16Oct 31, 2024Updated last year
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
facebookresearch / TimelineQA
View on GitHub
This is the repository for TimelineQA, a benchmark for querying lifelogs.
☆27Jul 5, 2023Updated 3 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
deep-privacy / SA-toolkit
View on GitHub
SA-toolkit: Speaker speech anonymization toolkit in python
☆33Sep 18, 2025Updated 10 months ago
madelonhulsebos / neural-table-representations-tutorial-2023
View on GitHub
Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…
☆21Jun 29, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zafarrafii / CQHC-Python
View on GitHub
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
☆29Sep 13, 2025Updated 10 months ago
bill9800 / Speech-denoise-Autoencoder
View on GitHub
Speech denoiser model using Keras
☆20Jan 23, 2019Updated 7 years ago
facebookresearch / PostText
View on GitHub
PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…
☆32Jun 14, 2023Updated 3 years ago
saiful9379 / BanglaASR
View on GitHub
Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset
☆12Apr 16, 2024Updated 2 years ago
alirezamshi / RQUGE
View on GitHub
The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]
☆17Apr 7, 2024Updated 2 years ago
muhdhuz / Audio_NeuralStyle
View on GitHub
An implementation of Neural Style Transfer for Audio using Pytorch.
☆11Dec 14, 2017Updated 8 years ago
IliaZenkov / sklearn-audio-classification
View on GitHub
An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …
☆79Nov 5, 2020Updated 5 years ago