joaoantoniocn / AM-MobileNet1DView external linksLinks
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 architecture and the Additive Margin Softmax (AM-Softmax) loss function.)
☆30Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for AM-MobileNet1D
Users that are interested in AM-MobileNet1D are comparing it to the libraries listed below
Sorting:
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Feb 1, 2017Updated 9 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆16Aug 31, 2017Updated 8 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆17Apr 8, 2016Updated 9 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Time-domain Audio Separation Network☆24Aug 3, 2018Updated 7 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Nov 25, 2016Updated 9 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- ☆13Oct 3, 2025Updated 4 months ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆41Jun 25, 2018Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago