SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,233Apr 28, 2021Updated 4 years ago
Alternatives and similar repositories for SincNet
Users that are interested in SincNet are comparing it to the libraries listed below
Sorting:
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,396Mar 14, 2022Updated 3 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆397Mar 21, 2024Updated last year
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆75May 18, 2021Updated 4 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆597Jan 20, 2022Updated 4 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆939Apr 13, 2024Updated last year
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- An Open Source Tools for Speaker Recognition☆635Aug 5, 2024Updated last year
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,533Jun 13, 2025Updated 8 months ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- The PyTorch-based audio source separation toolkit for researchers☆2,544Oct 6, 2025Updated 4 months ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,135Nov 24, 2025Updated 3 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,192Jul 25, 2024Updated last year
- A Python wrapper for Kaldi☆1,030Nov 30, 2025Updated 3 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,848Jul 22, 2025Updated 7 months ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,232Dec 27, 2025Updated 2 months ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆440Aug 12, 2025Updated 6 months ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆369Oct 9, 2021Updated 4 years ago
- A PyTorch-based Speech Toolkit☆11,243Feb 11, 2026Updated 2 weeks ago
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆520Mar 1, 2022Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Jul 17, 2020Updated 5 years ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆32Jun 30, 2020Updated 5 years ago
- End-to-End Speech Processing Toolkit☆9,747Updated this week
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,833Updated this week
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆399Feb 4, 2019Updated 7 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆209Dec 8, 2022Updated 3 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago