Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Nov 1, 2018Updated 7 years ago
Alternatives and similar repositories for Audio-Vision
Users that are interested in Audio-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MNSS (Music Noise Segmentation on a Spectrogram) is a deep-neural network based preprocessing technique that pre-filters unnecessary nois…☆11Dec 14, 2015Updated 10 years ago
- https://www.kaggle.com/c/flavours-of-physics☆23Oct 23, 2015Updated 10 years ago
- A multi-channel neural network audio classifier using Keras☆270Jul 29, 2021Updated 4 years ago
- Parallelize your computations in parallel-apply fashion.☆33Jul 19, 2019Updated 6 years ago
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Named-Entity Recognition model to extract "food" entities - Python☆47May 1, 2017Updated 9 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Jul 13, 2021Updated 4 years ago
- Neural network to classify some styles of Electronic music☆23Apr 4, 2019Updated 7 years ago
- my notebooks☆39Dec 1, 2021Updated 4 years ago
- A self-hosted drag-and-drop, nosql yet fully-featured file-scanning server.☆30Apr 21, 2022Updated 4 years ago
- A json version of the OpenCyc-latest.owl Ontology☆13Oct 27, 2011Updated 14 years ago
- audio classification using TensorFlow☆15Feb 9, 2017Updated 9 years ago
- RaspberryPi SenseHat☆10Sep 21, 2015Updated 10 years ago
- Whisp - Environmental Sound Classifier☆13Aug 14, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of an attack/decay model for piano transcription☆11Feb 1, 2018Updated 8 years ago
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆26Aug 29, 2017Updated 8 years ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago
- Bag-of-Features Acoustic Event Detection☆14Oct 5, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository consists of all the work done regarding Heart sound classification employing ANN, CNN and other methods, Android Applicat…☆18Jun 7, 2019Updated 7 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Jun 17, 2024Updated 2 years ago
- A mycroft skill wrapper around a subset of aircrack-ng☆17Jun 1, 2022Updated 4 years ago
- A streaming Speech to Text server using DeepSpeech☆16May 10, 2020Updated 6 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"