☆30Feb 21, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-soundnet
Users that are interested in pytorch-soundnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…☆10Dec 14, 2018Updated 7 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Oct 15, 2019Updated 6 years ago
- SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016☆464Oct 7, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- TensorFlow implementation of "SoundNet".☆145Mar 26, 2018Updated 8 years ago
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Aug 30, 2021Updated 4 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- Kervolutional neural networks☆16May 8, 2019Updated 6 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- Portable TLauncher Minecraft Launcher☆11Dec 29, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Analysis and Recognition of Voluntary Facial Expression Mimicry Based on Depression Patients☆15Feb 17, 2023Updated 3 years ago
- Added TCP reliability features over UDP. Implemented slow start, congestion avoidance, flow-control, fast retransmit and fast recovery me…☆11Jan 31, 2013Updated 13 years ago
- Vanilar-CNN face landmark☆23Aug 1, 2018Updated 7 years ago
- ☆14Oct 9, 2019Updated 6 years ago
- Multi-modal fusion framework based on Transformer Encoder☆16Dec 20, 2020Updated 5 years ago
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 3 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆19Oct 20, 2025Updated 5 months ago
- ☆36Aug 21, 2021Updated 4 years ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆64Feb 14, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- OpenVINO Post-Training Optimization Toolkit Tutorial☆16Sep 28, 2020Updated 5 years ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆15Nov 24, 2025Updated 4 months ago
- Using GMMs, WMV, ViBe and template match to detect moving object in various backgroud☆13Jun 20, 2018Updated 7 years ago
- Adversarial Auto-encoders for Speech Based Emotion Recogntion☆15Sep 22, 2018Updated 7 years ago
- Repository for the ACL 2023 conference website☆11Jan 9, 2024Updated 2 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 7 years ago
- Code for UAI 2019 paper "Domain Generalization via Multidomain Discriminant Analysis"☆14Aug 28, 2019Updated 6 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Enables inference and deployment of InnerEye-DeepLearning (https://github.com/microsoft/InnerEye-deeplearning) models as an async REST AP…☆21Mar 21, 2024Updated 2 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 6 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year