itmo-mbss-lab/sr_labs_book

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/itmo-mbss-lab/sr_labs_book)

itmo-mbss-lab / sr_labs_book

The project is related to the development of labs for the ITMO Speaker Recognition Course.

☆16

Alternatives and similar repositories for sr_labs_book

Users that are interested in sr_labs_book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Speech-Lab-IITM / Hindi-ASR-Challenge
View on GitHub
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆10Nov 5, 2020Updated 5 years ago
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
radinshayanfar / speaker-verification
View on GitHub
Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)
☆12Sep 15, 2022Updated 3 years ago
v-iashin / VoxCeleb
View on GitHub
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
☆12Dec 11, 2019Updated 6 years ago
gaochangw / DeltaRNN
View on GitHub
Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Data
☆18Sep 30, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LoopPerfect / satori
View on GitHub
An HTTP server library in C++
☆16Jan 10, 2019Updated 7 years ago
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
xudonmao / improved_LSGAN
View on GitHub
☆16Mar 9, 2018Updated 8 years ago
kuz / Introduction-to-GPU-Computing
View on GitHub
Slides and example code for the seminar presentation about general purpose computations on GPU
☆12Jan 3, 2015Updated 11 years ago
zengchang233 / asv_neural_network
View on GitHub
neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)
☆21Oct 23, 2019Updated 6 years ago
ainnotate / StreamingSpeakerDiarization
View on GitHub
Lightweight python library for speaker diarization in real time implemented in pytorch
☆12Oct 12, 2022Updated 3 years ago
wikiscript / countries.json
View on GitHub
World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Data
☆14Dec 10, 2020Updated 5 years ago
aws-samples / content-based-item-recommender
View on GitHub
☆10Apr 2, 2024Updated 2 years ago
ASPP / testing_debugging_profiling
View on GitHub
Material for the class "Testing, debugging, profiling -- Python tools for building software"
☆14Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
manishpandit / speaker-recognition
View on GitHub
Text independent speaker recognition algorithm based on CNN
☆24Aug 30, 2025Updated 10 months ago
juanmc2005 / SpeakerEmbeddingLossComparison
View on GitHub
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆61Oct 7, 2020Updated 5 years ago
stas6626 / IDRnd
View on GitHub
ID R&D Voice Antispoofing Challenge Solution
☆11Jul 27, 2019Updated 7 years ago
emckiernan / electrophys
View on GitHub
Electrophysiology practicals for undergraduate students
☆13Mar 8, 2021Updated 5 years ago
asogaard / Wavenet
View on GitHub
C++ package for learning optimal wavelet bases using a neural network approach.
☆14Dec 2, 2016Updated 9 years ago
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
stevedem / FormRecognizerAccelerator
View on GitHub
Solution Accelerator: Using Logic Apps & Form Recognizer
☆15Sep 22, 2023Updated 2 years ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
View on GitHub
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆14Jun 6, 2023Updated 3 years ago
lvrysis / Audio-DNN-Classification
View on GitHub
Deep Neural Networks for audio classification
☆10Apr 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Isminoula / TextNormSeq2Seq
View on GitHub
Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSM…
☆38Jun 5, 2021Updated 5 years ago
vgaurav3011 / Statistics-for-Machine-Learning
View on GitHub
☆10Aug 13, 2020Updated 5 years ago
nsu-ai-team / conv1d-text-vae
View on GitHub
A variational autoencoder for text processing using 1D convolutions and the FastText word embeddings
☆12Dec 11, 2022Updated 3 years ago
giakoumoglou / rrd
View on GitHub
PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073
☆15Dec 2, 2025Updated 7 months ago
parvatijay2901 / Hindi-ASR-and-TTS
View on GitHub
EC499: Major Project
☆11Jun 25, 2023Updated 3 years ago
arijitx / Amazon-Satelite-Image-Labeling
View on GitHub
This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Tel…
☆12May 10, 2018Updated 8 years ago
rodosingh / Intro-NLP-IIITH
View on GitHub
Course Materials (along with assignments) for Intro to NLP, done as a part for requirement of the course "Introduction to NLP" (course-co…
☆10Jan 2, 2023Updated 3 years ago
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
ferrinweb / voice-input-button
View on GitHub
A vue voice input button component, based on iFLYTEK speech api. 一个基于讯飞语音听写 api （旧版接口）的语音输入按钮 vue 组件。基于讯飞新版语音听写（流式版）api 的语音输入按钮 vue 组件请查看…
☆18Aug 27, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ranchlai / awesome-speaker-embedding
View on GitHub
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
☆52Aug 12, 2021Updated 4 years ago
tashapiro / predicting-song-music-genre
View on GitHub
What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?
☆14Jan 2, 2023Updated 3 years ago
ameya1995 / Constrictor
View on GitHub
An agent-first dependency and blast-radius explorer for Python codebases. Generates structured, machine-readable dependency graphs that A…
☆18Mar 18, 2026Updated 4 months ago
ZurichNLP / domain-robustness
View on GitHub
☆13Dec 11, 2020Updated 5 years ago
ZQSong1997 / AVMFN-For-Person-Verification
View on GitHub
Bimodal Adaptive Feature Fusion Network for Person Verification
☆20Jul 30, 2022Updated 3 years ago
dylanmikesell / PhaseWeightedStacking
View on GitHub
☆15Mar 21, 2015Updated 11 years ago
titu1994 / keras_novograd
View on GitHub
Keras implementation of NovoGrad
☆20Aug 21, 2020Updated 5 years ago