Leethony / Additive-Margin-Softmax-Loss-Pytorch
Additive margin softmax loss in pytorch
☆46Updated 5 years ago
Alternatives and similar repositories for Additive-Margin-Softmax-Loss-Pytorch:
Users that are interested in Additive-Margin-Softmax-Loss-Pytorch are comparing it to the libraries listed below
- Code for the Active Speakers in Context Paper (CVPR2020)☆53Updated 3 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆45Updated 3 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆25Updated 2 years ago
- ☆21Updated 3 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆16Updated last year
- Code for DCASE 2019 Task 1a, 1b and 1c☆21Updated 7 months ago
- Metric Learning (npair loss & angular loss) on mnist and Visualizing by t_SNE☆35Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆142Updated last year
- Speaker recognition ,Voiceprint recognition☆52Updated 5 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 7 months ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- The official repository for Audio ALBERT☆64Updated 3 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆111Updated 4 years ago
- ☆9Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆25Updated 4 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated last month
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆213Updated last year
- Emotion recognition library for PyTorch☆21Updated 4 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆69Updated 3 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆126Updated 3 years ago
- A minimal pytorch package implementing a gradient reversal layer.☆157Updated 3 months ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆57Updated 3 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆12Updated 4 years ago
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆85Updated 5 years ago