Leethony / Additive-Margin-Softmax-Loss-PytorchLinks
Additive margin softmax loss in pytorch
☆48Updated 6 years ago
Alternatives and similar repositories for Additive-Margin-Softmax-Loss-Pytorch
Users that are interested in Additive-Margin-Softmax-Loss-Pytorch are comparing it to the libraries listed below
Sorting:
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆30Updated 5 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Updated 2 years ago
- A minimal pytorch package implementing a gradient reversal layer.☆158Updated last year
- PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI☆184Updated 2 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆47Updated 4 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Updated 5 years ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Updated 4 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆115Updated 5 years ago
- Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.☆73Updated 4 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- Gradient Reversal Layer for Domain Adaptation☆131Updated 3 years ago
- Metric Learning (npair loss & angular loss) on mnist and Visualizing by t_SNE☆35Updated 2 years ago
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…☆34Updated 2 years ago
- PyTorch samplers that output roughly balanced batches with support for multilabel datasets☆56Updated last year
- Code for the Active Speakers in Context Paper (CVPR2020)☆56Updated 4 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Updated 4 years ago
- Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)☆498Updated 2 years ago
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Updated last year
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87Updated 6 years ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Updated 3 years ago
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆408Updated 4 years ago
- Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…☆68Updated 3 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Updated 3 years ago
- my codes for learning attention mechanism☆51Updated 5 years ago
- Cross-model active contrastive coding☆22Updated 4 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆433Updated 2 years ago
- ☆12Updated 4 years ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆59Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆350Updated 4 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Updated 3 years ago