The project is related to the development of labs for the ITMO Speaker Recognition Course.
β15Apr 1, 2026Updated 2 weeks ago
Alternatives and similar repositories for sr_labs_book
Users that are interested in sr_labs_book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ10Nov 5, 2020Updated 5 years ago
- β15Jan 24, 2017Updated 9 years ago
- Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)β12Sep 15, 2022Updated 3 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification datasetβ12Dec 11, 2019Updated 6 years ago
- Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Dataβ17Sep 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- superfast text to speech in any voiceβ62Feb 16, 2026Updated 2 months ago
- β‘ Building applications with LLMs through composability β‘β19Jan 30, 2024Updated 2 years ago
- Lightweight python library for speaker diarization in real time implemented in pytorchβ11Oct 12, 2022Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25May 6, 2019Updated 6 years ago
- neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)β21Oct 23, 2019Updated 6 years ago
- β16Mar 9, 2018Updated 8 years ago
- Slides and example code for the seminar presentation about general purpose computations on GPUβ12Jan 3, 2015Updated 11 years ago
- An HTTP server library in C++β16Jan 10, 2019Updated 7 years ago
- World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Dataβ14Dec 10, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β10Apr 2, 2024Updated 2 years ago
- ID R&D Voice Antispoofing Challenge Solutionβ11Jul 27, 2019Updated 6 years ago
- Electrophysiology practicals for undergraduate studentsβ13Mar 8, 2021Updated 5 years ago
- Solution Accelerator: Using Logic Apps & Form Recognizerβ15Sep 22, 2023Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSPβ¦β61Oct 7, 2020Updated 5 years ago
- A repo to do interpretability of pre-trained acoustic modelsβ15Oct 15, 2023Updated 2 years ago
- C++ package for learning optimal wavelet bases using a neural network approach.β14Dec 2, 2016Updated 9 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β15Jun 6, 2023Updated 2 years ago
- Text independent speaker recognition algorithm based on CNNβ24Aug 30, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deep Neural Networks for audio classificationβ11Apr 11, 2024Updated 2 years ago
- Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSMβ¦β38Jun 5, 2021Updated 4 years ago
- Material for the class "Testing, debugging, profiling -- Python tools for building software"β14Nov 7, 2025Updated 5 months ago
- A variational autoencoder for text processing using 1D convolutions and the FastText word embeddingsβ12Dec 11, 2022Updated 3 years ago
- [arXiv 2024] PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073β15Dec 2, 2025Updated 4 months ago
- EC499: Major Projectβ10Jun 25, 2023Updated 2 years ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Telβ¦β12May 10, 2018Updated 7 years ago
- Bimodal Adaptive Feature Fusion Network for Person Verificationβ20Jul 30, 2022Updated 3 years ago
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?β14Jan 2, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"β45Oct 30, 2025Updated 5 months ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.β52Aug 12, 2021Updated 4 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.β13Mar 8, 2016Updated 10 years ago
- β13Dec 11, 2020Updated 5 years ago
- This is a curated list of awesome ASV(Automatic Speaker Verification) Anti-Spoofing papers, libraries, datasets, and other resources.β22May 21, 2021Updated 4 years ago
- β15Mar 21, 2015Updated 11 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguisβ¦β15Apr 3, 2022Updated 4 years ago