Extract audio embeddings from an audio file using Python
☆13Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for audio-embedding
Users that are interested in audio-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- Multi-factor Risk Models of Asset or Portfolio Returns☆10May 4, 2021Updated 4 years ago
- ☆19May 9, 2019Updated 6 years ago
- Import of Adobe/Mozilla library for generating machine code to implement JIT compilers☆23Jan 18, 2011Updated 15 years ago
- Low resource machine translation using Transformers and Iterative Back translation☆10Apr 24, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- redis module unit tests with python (deprecated) please see RLTest☆12Sep 8, 2019Updated 6 years ago
- Conversational Agent for Twitter and Discord☆10Mar 20, 2026Updated last week
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- ☆10Aug 3, 2019Updated 6 years ago
- Dynamic bindings to the CUDA library for the D Programming Language.☆17Feb 22, 2019Updated 7 years ago
- Create geometry by revolving path around Y axis☆13Aug 27, 2025Updated 7 months ago
- The libdill tutorial.☆16Nov 3, 2019Updated 6 years ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Scripts to convert audio files to spectrograms and back☆11Nov 23, 2017Updated 8 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- Google Analytics widgets for Mozaïk dashboard☆15Aug 17, 2017Updated 8 years ago
- ☆18Sep 16, 2022Updated 3 years ago
- ☆14Jul 28, 2023Updated 2 years ago
- An audio model for recognizing a whistle pattern was trained to toggle a Sonoff/Ewelink socket device connected to a room light☆11May 8, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Understanding A/B testing through Monte Carlo simulation☆17Feb 12, 2015Updated 11 years ago
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.☆11Jan 25, 2023Updated 3 years ago
- ☆49May 3, 2020Updated 5 years ago
- Rainbowgram with Python☆13Jan 28, 2019Updated 7 years ago
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Dimensionality reduction (UMAP, t-SNE, PCA) for ImageJ/Fiji☆12May 6, 2025Updated 10 months ago
- Generates spectrogram from images☆13Apr 26, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Grad-CAM (Gradient-weighted Class Activation Mapping)☆13Dec 20, 2019Updated 6 years ago
- A port of sebh's atmosphere model to wgpu + WGSL☆13Aug 19, 2023Updated 2 years ago
- Base Repository for the Script Language Container for user defined functions (UDF's) that can be used in the EXASOL database. You can fin…☆13Mar 20, 2026Updated last week
- ☆12May 1, 2019Updated 6 years ago
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 7 years ago
- Generate random melody under specific rules.☆42Apr 12, 2018Updated 7 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆11Jun 22, 2020Updated 5 years ago