rwightman / pytorch-commands
Some PyTorch code for the Kaggle Speech Recognition Challenge
☆12Updated 6 years ago
Alternatives and similar repositories for pytorch-commands:
Users that are interested in pytorch-commands are comparing it to the libraries listed below
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 6 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- PyTorch implementation of the Feed-Forward Attention Mechanism.☆18Updated 6 years ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- ASR project with pytorch-lightning☆20Updated this week
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 6 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- A PyTorch implementation of SimSiam based on CVPR 2021 paper "Exploring Simple Siamese Representation Learning"☆10Updated 4 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- code for paper "learning to fool the speaker recognition"☆10Updated 4 years ago
- Code for ICASSP 2019 paper☆18Updated 6 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 5 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- ☆15Updated 5 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Source code for "Towards a Deeper Understanding of Adversarial Losses under a Discriminative Adversarial Network Setting"☆42Updated 2 years ago
- ☆17Updated 3 years ago
- Weakly-supervised visual instrument-playing detection☆10Updated 3 years ago
- ☆27Updated 5 years ago
- Curriculum Vitae of Quan Wang☆15Updated 2 months ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到 端语音识别☆12Updated 4 years ago
- Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks☆18Updated 5 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago