☆30Feb 21, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-soundnet
Users that are interested in pytorch-soundnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Oct 15, 2019Updated 6 years ago
- SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016☆464Oct 7, 2017Updated 8 years ago
- TensorFlow implementation of "SoundNet".☆145Mar 26, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repo accompanying the blog post "How to Deploy A State-of-the-art PyTorch Model to iOS via Core ML (Part 3)".☆16Jul 3, 2020Updated 5 years ago
- Automatic PCG classification for heart disease screening☆11Aug 2, 2018Updated 7 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Aug 30, 2021Updated 4 years ago
- soundnet and localize sound source☆12Dec 7, 2020Updated 5 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Oct 2, 2020Updated 5 years ago
- Portable TLauncher Minecraft Launcher☆14Dec 29, 2023Updated 2 years ago
- Multi-modal fusion framework based on Transformer Encoder☆16Dec 20, 2020Updated 5 years ago
- ICCV 2019 Tutorial: Global Optimization for Geometric Understanding with Provable Guarantees☆15Oct 20, 2022Updated 3 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆16May 27, 2024Updated last year
- "MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23☆23Feb 26, 2023Updated 3 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆19Oct 20, 2025Updated 6 months ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenVINO Post-Training Optimization Toolkit Tutorial☆16Sep 28, 2020Updated 5 years ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆15Nov 24, 2025Updated 5 months ago
- Adversarial Auto-encoders for Speech Based Emotion Recogntion☆15Sep 22, 2018Updated 7 years ago
- Repository for the ACL 2023 conference website☆11Jan 9, 2024Updated 2 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning☆11Jun 1, 2024Updated last year
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆30Apr 9, 2026Updated last month
- Code for UAI 2019 paper "Domain Generalization via Multidomain Discriminant Analysis"☆14Aug 28, 2019Updated 6 years ago
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 6 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Robust estimation of local affine maps and its applications to image matching☆16Mar 24, 2023Updated 3 years ago
- Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)☆18Oct 18, 2024Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- ☆12Jun 14, 2022Updated 3 years ago