Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
☆25Oct 15, 2020Updated 5 years ago
Alternatives and similar repositories for VGGVox-PyTorch
Users that are interested in VGGVox-PyTorch are comparing it to the libraries listed below
Sorting:
- Implementation of the VGGVox network using TensorFlow.☆16Sep 1, 2019Updated 6 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆17Jan 26, 2021Updated 5 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- SVHF-Net for Cross-modal binary matching☆32Aug 22, 2018Updated 7 years ago
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- ☆11Sep 4, 2023Updated 2 years ago
- Speech Emotion Recognition using Deep Learning☆12May 24, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- ☆15Apr 4, 2023Updated 2 years ago
- Predicting Political Instability and Social Conflicts Using Multimodal Data☆10Jun 6, 2016Updated 9 years ago
- TaiYiXLCheckpointLoader: An unoffical node support Taiyi-Diffusion-XL(Taiyi-XL) Chinese-English bilingual language model☆11Sep 1, 2024Updated last year
- PyTorch re-implementation of some papers on image captioning | 图像描述☆14Apr 22, 2021Updated 4 years ago
- [AAAI 2024] SAAS - Official PyTorch Implementation☆11Mar 28, 2024Updated last year
- ☆12Apr 26, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- ☆10Feb 19, 2021Updated 5 years ago
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- An experimental custom seq-2-seq model with both layer-wise (inter-layer), and intra-layer attention (attention to previous hidden states…☆10Nov 30, 2017Updated 8 years ago
- Network architecture for "Volumetric ConvNets with Mixed Residual Connections for Automated Prostate Segmentation from 3D MR Images"☆10Feb 28, 2017Updated 9 years ago
- A Neat Litho-Resist Simulator☆22Oct 13, 2025Updated 4 months ago
- A model combining Deep Neural Networks and (Stochastic) Random Forests.☆14Jun 5, 2018Updated 7 years ago
- Bayesian modelling approach for detecting RNA flexibility changes in high-throughput structure probing data under different conditions, b…☆11Dec 8, 2022Updated 3 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆10Apr 17, 2021Updated 4 years ago
- DeepFovea++: Reconstruction and Super-Resolution for Natural Foveated Rendered Videos (PyTorch).☆10Mar 28, 2022Updated 3 years ago
- PyTorch implementation of A Neural Algorithm of Artistic Style☆10Dec 20, 2019Updated 6 years ago
- Official implementation of BPA (CVPR 2022)☆13Jun 17, 2022Updated 3 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆21Oct 7, 2025Updated 5 months ago
- Radam+lookahead implemented by tensorflow☆11Oct 14, 2019Updated 6 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…☆12Jun 26, 2025Updated 8 months ago
- ☆11Mar 11, 2025Updated 11 months ago
- Create speaker voiceprints from a few seconds of audio. And, identify individuals in real-time streaming or recorded conversations.☆14Feb 4, 2019Updated 7 years ago
- Human Pose Estimation in Real-World Metric Coordinates☆12Jul 6, 2023Updated 2 years ago
- ☆12Apr 8, 2024Updated last year