Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.
☆40Nov 1, 2018Updated 7 years ago
Alternatives and similar repositories for Audio-Vision
Users that are interested in Audio-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MNSS (Music Noise Segmentation on a Spectrogram) is a deep-neural network based preprocessing technique that pre-filters unnecessary nois…☆11Dec 14, 2015Updated 10 years ago
- python template private service☆18Oct 20, 2020Updated 5 years ago
- https://www.kaggle.com/c/flavours-of-physics☆23Oct 23, 2015Updated 10 years ago
- Scene Classification using Audio in the nearby Environment.☆19Sep 4, 2019Updated 6 years ago
- A multi-channel neural network audio classifier using Keras☆271Jul 29, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Jul 13, 2021Updated 4 years ago
- Neural network to classify some styles of Electronic music☆23Apr 4, 2019Updated 7 years ago
- This is my Masters thesis project titled "Speaker Detection and Conversation Analysis on Mobile Devices".☆15May 21, 2017Updated 9 years ago
- A json version of the OpenCyc-latest.owl Ontology☆13Oct 27, 2011Updated 14 years ago
- RaspberryPi SenseHat☆10Sep 21, 2015Updated 10 years ago
- Web framework for GeoSolver☆14Feb 18, 2017Updated 9 years ago
- ☆11Mar 15, 2017Updated 9 years ago
- Python implementation of the Detection of Envelope Modulation On Noise (DEMON) algorithm.☆17Jan 5, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of an attack/decay model for piano transcription☆11Feb 1, 2018Updated 8 years ago
- Convolutional Neural Network based implementation of Audio Event Recognition in KERAS☆14Nov 28, 2017Updated 8 years ago
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago
- ☆14Nov 13, 2023Updated 2 years ago
- Text pair classification☆13Jun 24, 2017Updated 8 years ago
- Android sound localization and classification app.☆14Jul 4, 2025Updated 10 months ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Mycroft Skills Kit☆27Jun 2, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Mycroft Skills Manager☆14Feb 1, 2022Updated 4 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆26Aug 29, 2017Updated 8 years ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- Bag-of-Features Acoustic Event Detection☆14Oct 5, 2016Updated 9 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Jun 17, 2024Updated last year
- The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024☆19Sep 29, 2024Updated last year
- A mycroft skill wrapper around a subset of aircrack-ng☆17Jun 1, 2022Updated 3 years ago
- This code is to implement the model-free control algorithm as introduced in the paper Model-free control by Michel Fliess and Cedric Join…☆13Nov 29, 2017Updated 8 years ago
- Meta-Music is an open-source project that lets people add metadata to their Music library using custom build audio fingerprinting and re…☆12Nov 5, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A streaming Speech to Text server using DeepSpeech☆16May 10, 2020Updated 6 years ago
- Codes for "Event localization in music auto-tagging"☆31Mar 15, 2017Updated 9 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 4 months ago
- keras project for audio deep learning☆40Apr 10, 2018Updated 8 years ago
- SAAVN Code release for paper "Sound Adversarial Audio-Visual Navigation,ICLR2022" (In PyTorch)☆21Nov 9, 2022Updated 3 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Oct 15, 2019Updated 6 years ago
- The library is useful for analyzing the emotions present in any audio file(call/music/recordings) into three classes namely positive, neg…☆32Jul 26, 2016Updated 9 years ago