vladimirstarygin / Subcenter-ArcFace-PytorchLinks
Train and filter data using Subcenter ArcFace model in Pytorch
☆15Updated 3 years ago
Alternatives and similar repositories for Subcenter-ArcFace-Pytorch
Users that are interested in Subcenter-ArcFace-Pytorch are comparing it to the libraries listed below
Sorting:
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Updated 2 years ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆69Updated 3 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 8 months ago
- SpeechYOLO Interspeech 2019☆44Updated 3 years ago
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…☆33Updated 2 years ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
- Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…☆46Updated 4 years ago
- ☆19Updated 8 months ago
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆66Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated 2 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆86Updated 4 months ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 4 years ago
- ☆87Updated 2 years ago
- ☆16Updated 4 years ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆25Updated 6 months ago
- Implementations of Recent Papers in Computer Vision☆38Updated 2 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- ☆28Updated 2 years ago
- Active Speaker Detection☆19Updated 5 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆101Updated 6 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆65Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- BirdCLEF 2021 - Birdcall Identification 4th place solution☆50Updated 4 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Updated 2 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆13Updated last year
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆54Updated 2 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Updated 3 years ago
- ☆93Updated 2 years ago