SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,240Apr 28, 2021Updated 5 years ago
Alternatives and similar repositories for SincNet
Users that are interested in SincNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆75May 18, 2021Updated 5 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆403Mar 21, 2024Updated 2 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆599Jan 20, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆321Nov 11, 2020Updated 5 years ago
- In defence of metric learning for speaker recognition☆1,166Apr 22, 2026Updated 3 weeks ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆941Apr 13, 2024Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- Speaker embedding(verification and recognition) using Pytorch☆369Jul 24, 2020Updated 5 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A library for speech data augmentation in time-domain☆687Aug 30, 2021Updated 4 years ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆367Oct 9, 2021Updated 4 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆213Jul 17, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆655Apr 5, 2022Updated 4 years ago
- A PyTorch-based Speech Toolkit☆11,548Updated this week
- A Python wrapper for Kaldi☆1,035Nov 30, 2025Updated 5 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,554Mar 12, 2026Updated 2 months ago
- Neural speaker recognition/verification system based on Kaldi and Tensorflow☆31Jun 30, 2020Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,867Jul 22, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆499Jul 1, 2021Updated 4 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,568Updated this week
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,147Nov 24, 2025Updated 5 months ago
- Audio processing by using pytorch 1D convolution network☆1,123Dec 7, 2025Updated 5 months ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆401Feb 4, 2019Updated 7 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆449Aug 12, 2025Updated 9 months ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,205Jul 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆526Mar 1, 2022Updated 4 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Dec 8, 2022Updated 3 years ago
- End-to-End Speech Processing Toolkit☆9,836Updated this week
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,270Apr 13, 2026Updated last month
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verification☆790Mar 3, 2020Updated 6 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,872Updated this week