☆30Feb 21, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-soundnet
Users that are interested in pytorch-soundnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 18, 2018Updated 7 years ago
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- This repository contains code for classification of sound using spectrograms. We train a CNN to classify the sounds after converting to s…☆10Dec 14, 2018Updated 7 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Oct 15, 2019Updated 6 years ago
- SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016☆465Oct 7, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pytorch implementation of DSR-RL for Video Summarization Task☆12Aug 30, 2021Updated 4 years ago
- soundnet and localize sound source☆12Dec 7, 2020Updated 5 years ago
- Room acoustic simulator with a SOFA file loader.☆23Sep 27, 2024Updated last year
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- ☆11Aug 20, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- ☆18Dec 13, 2023Updated 2 years ago
- extensible NMEA-0183 parser/encoder for node.js☆20Mar 19, 2019Updated 7 years ago
- Added TCP reliability features over UDP. Implemented slow start, congestion avoidance, flow-control, fast retransmit and fast recovery me…☆11Jan 31, 2013Updated 13 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Vanilar-CNN face landmark☆23Aug 1, 2018Updated 7 years ago
- Collision detection; Ship domain☆13Sep 19, 2019Updated 6 years ago
- ☆15Oct 27, 2020Updated 5 years ago
- Multi-modal fusion framework based on Transformer Encoder☆16Dec 20, 2020Updated 5 years ago
- Official Repository for paper "Ambisonizer: Neural Upmixing as Spherical Harmonics Generation"☆16May 27, 2024Updated 2 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆20Oct 20, 2025Updated 7 months ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- OpenVINO Post-Training Optimization Toolkit Tutorial☆16Sep 28, 2020Updated 5 years ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆72Feb 14, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Using GMMs, WMV, ViBe and template match to detect moving object in various backgroud☆13Jun 20, 2018Updated 7 years ago
- The Official Code Repo for EgoOrientBench [CVPR25]☆15Nov 24, 2025Updated 6 months ago
- Adversarial Auto-encoders for Speech Based Emotion Recogntion☆15Sep 22, 2018Updated 7 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- [ICML 2024] Temporal Spiking Neural Networks with Synaptic Delay for Graph Reasoning☆11Jun 1, 2024Updated last year
- Repository containg experiments with Extreme Learning Machines And Reservoir Computing, ELMARC.☆20May 1, 2018Updated 8 years ago
- Code for UAI 2019 paper "Domain Generalization via Multidomain Discriminant Analysis"☆14Aug 28, 2019Updated 6 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 6 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Sep 27, 2020Updated 5 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- ☆11May 18, 2022Updated 4 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 5 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago