The project is related to the development of labs for the ITMO Speaker Recognition Course.
β16Apr 1, 2026Updated last month
Alternatives and similar repositories for sr_labs_book
Users that are interested in sr_labs_book are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ10Nov 5, 2020Updated 5 years ago
- β15Jan 24, 2017Updated 9 years ago
- Speaker verification task with ECAPA-TDNN model (trained on Persian dataset)β12Sep 15, 2022Updated 3 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification datasetβ12Dec 11, 2019Updated 6 years ago
- Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Dataβ17Sep 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β‘ Building applications with LLMs through composability β‘β19Jan 30, 2024Updated 2 years ago
- Lightweight python library for speaker diarization in real time implemented in pytorchβ11Oct 12, 2022Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25May 6, 2019Updated 7 years ago
- β16Mar 9, 2018Updated 8 years ago
- neural network and loss for asv implemented by PyTorch. (Triplet loss, LMCL, Angular Loss, Softmax)β21Oct 23, 2019Updated 6 years ago
- Slides and example code for the seminar presentation about general purpose computations on GPUβ12Jan 3, 2015Updated 11 years ago
- An HTTP server library in C++β16Jan 10, 2019Updated 7 years ago
- World Country Profiles Sourced from Wikipedia's Country Page Infoboxes Converted into JSON - Free Open Public Domain Dataβ14Dec 10, 2020Updated 5 years ago
- β10Apr 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ID R&D Voice Antispoofing Challenge Solutionβ11Jul 27, 2019Updated 6 years ago
- Electrophysiology practicals for undergraduate studentsβ13Mar 8, 2021Updated 5 years ago
- superfast text to speech in any voiceβ63Feb 16, 2026Updated 2 months ago
- Solution Accelerator: Using Logic Apps & Form Recognizerβ15Sep 22, 2023Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSPβ¦β61Oct 7, 2020Updated 5 years ago
- A repo to do interpretability of pre-trained acoustic modelsβ15Oct 15, 2023Updated 2 years ago
- C++ package for learning optimal wavelet bases using a neural network approach.β14Dec 2, 2016Updated 9 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β15Jun 6, 2023Updated 2 years ago
- Text independent speaker recognition algorithm based on CNNβ24Aug 30, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep Neural Networks for audio classificationβ11Apr 11, 2024Updated 2 years ago
- Code and model files for paper: I. Lourentzou et al., Adapting Sequence to Sequence models for Text Normalization in Social Media", ICWSMβ¦β38Jun 5, 2021Updated 4 years ago
- Material for the class "Testing, debugging, profiling -- Python tools for building software"β14Nov 7, 2025Updated 6 months ago
- A variational autoencoder for text processing using 1D convolutions and the FastText word embeddingsβ12Dec 11, 2022Updated 3 years ago
- PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073β15Dec 2, 2025Updated 5 months ago
- EC499: Major Projectβ10Jun 25, 2023Updated 2 years ago
- This is my CS 763 Computer Vision Course Project , Here we try to label Amazon Satelite Images. Here we try to implement the Show and Telβ¦β12May 10, 2018Updated 7 years ago
- Bimodal Adaptive Feature Fusion Network for Person Verificationβ20Jul 30, 2022Updated 3 years ago
- What part of a song is better at determining it's music genre - the music (audio features) or the lyrics (NLP) ?β14Jan 2, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"β45Oct 30, 2025Updated 6 months ago
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.β52Aug 12, 2021Updated 4 years ago
- Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.β13Mar 8, 2016Updated 10 years ago
- β13Dec 11, 2020Updated 5 years ago
- This is a curated list of awesome ASV(Automatic Speaker Verification) Anti-Spoofing papers, libraries, datasets, and other resources.β22May 21, 2021Updated 4 years ago
- β15Mar 21, 2015Updated 11 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguisβ¦β15Apr 3, 2022Updated 4 years ago