Leethony / Additive-Margin-Softmax-Loss-Pytorch
Additive margin softmax loss in pytorch
☆46Updated 5 years ago
Alternatives and similar repositories for Additive-Margin-Softmax-Loss-Pytorch:
Users that are interested in Additive-Margin-Softmax-Loss-Pytorch are comparing it to the libraries listed below
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated 5 months ago
- Metric Learning (npair loss & angular loss) on mnist and Visualizing by t_SNE☆35Updated 2 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 4 months ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆57Updated 3 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆46Updated 3 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆66Updated 2 years ago
- Official repository for "Orthogonal Projection Loss" (ICCV'21)☆121Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated 2 years ago
- ☆21Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Updated 4 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 10 months ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆25Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- The official repository for Audio ALBERT☆65Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆144Updated last year
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆21Updated 9 months ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆113Updated 4 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆18Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆54Updated 3 years ago
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆43Updated 2 years ago
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆90Updated 2 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆54Updated 3 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆26Updated 5 years ago