The project is related to the development of labs for the ITMO Speaker Recognition Course.
β15Mar 23, 2026Updated this week
Alternatives and similar repositories for sr_labs_book
Users that are interested in sr_labs_book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ10Nov 5, 2020Updated 5 years ago
- β15Jan 24, 2017Updated 9 years ago
- Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)β12Sep 15, 2022Updated 3 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification datasetβ12Dec 11, 2019Updated 6 years ago
- Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Dataβ16Sep 30, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- superfast text to speech in any voiceβ61Feb 16, 2026Updated last month
- β‘ Building applications with LLMs through composability β‘β17Jan 30, 2024Updated 2 years ago
- Lightweight python library for speaker diarization in real time implemented in pytorchβ10Oct 12, 2022Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25May 6, 2019Updated 6 years ago
- neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)β21Oct 23, 2019Updated 6 years ago
- β16Mar 9, 2018Updated 8 years ago
- Slides and example code for the seminar presentation about general purpose computations on GPUβ12Jan 3, 2015Updated 11 years ago
- An HTTP server library in C++β16Jan 10, 2019Updated 7 years ago
- World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Dataβ14Dec 10, 2020Updated 5 years ago
- NordVPN Special Discount Offer β’ AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- β10Apr 2, 2024Updated last year
- ID R&D Voice Antispoofing Challenge Solutionβ11Jul 27, 2019Updated 6 years ago
- Electrophysiology practicals for undergraduate studentsβ13Mar 8, 2021Updated 5 years ago
- Solution Accelerator: Using Logic Apps & Form Recognizerβ15Sep 22, 2023Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSPβ¦β61Oct 7, 2020Updated 5 years ago
- A repo to do interpretability of pre-trained acoustic modelsβ15Oct 15, 2023Updated 2 years ago
- C++ package for learning optimal wavelet bases using a neural network approach.β14Dec 2, 2016Updated 9 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β15Jun 6, 2023Updated 2 years ago
- Text independent speaker recognition algorithm based on CNNβ24Aug 30, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Deep Neural Networks for audio classificationβ11Apr 11, 2024Updated last year
- Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSMβ¦β38Jun 5, 2021Updated 4 years ago
- Material for the class "Testing, debugging, profiling -- Python tools for building software"β14Nov 7, 2025Updated 4 months ago
- A variational autoencoder for text processing using 1D convolutions and the FastText word embeddingsβ12Dec 11, 2022Updated 3 years ago
- [arXiv 2024] PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073β15Dec 2, 2025Updated 3 months ago
- EC499: Major Projectβ10Jun 25, 2023Updated 2 years ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Telβ¦β13May 10, 2018Updated 7 years ago
- Bimodal Adaptive Feature Fusion Network for Person Verificationβ20Jul 30, 2022Updated 3 years ago
- Cognitive memory for AI agents. Pure Rust, <1ms recall, 2.7MB, zero cloud. Patent Pending.β55Mar 17, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?β14Jan 2, 2023Updated 3 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"β44Oct 30, 2025Updated 4 months ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.β52Aug 12, 2021Updated 4 years ago
- β13Dec 11, 2020Updated 5 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.β13Mar 8, 2016Updated 10 years ago
- This is a curated list of awesome ASV(Automatic Speaker Verification) Anti-Spoofing papers, libraries, datasets, and other resources.β22May 21, 2021Updated 4 years ago
- β15Mar 21, 2015Updated 11 years ago