Extract audio embeddings from an audio file using Python
☆13Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for audio-embedding
Users that are interested in audio-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- ☆18Jul 22, 2024Updated last year
- Generate embedding vectors from audio files☆59Sep 17, 2025Updated 7 months ago
- Transform geometry positions with a 4x4 transformation matrix.☆13Dec 27, 2015Updated 10 years ago
- Code for the paper Data-to-Text Generation with Iterative Text Editing☆14Mar 23, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆22Jan 13, 2025Updated last year
- Efficient multi-threaded task scheduler using generic re-usable WebWorkers.☆11Jan 18, 2022Updated 4 years ago
- Conversational Agent for Twitter and Discord☆10Updated this week
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Classify ground motion waves into earthquakes or blasts using traditional Machine Learning algorithms.☆10May 21, 2018Updated 7 years ago
- ☆10Aug 3, 2019Updated 6 years ago
- Create geometry by revolving path around Y axis☆13Aug 27, 2025Updated 8 months ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Scripts to convert audio files to spectrograms and back☆12Nov 23, 2017Updated 8 years ago
- Fuzz-Free Web Audio Scheduling☆16Jul 6, 2023Updated 2 years ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- ☆16Jan 4, 2022Updated 4 years ago
- WaveGANによる音声生成器☆13Feb 9, 2024Updated 2 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- This repository contains TA sessions work for the Machine Learning course, Aug '18 - Dec '18.☆11Nov 17, 2018Updated 7 years ago
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Sep 16, 2022Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- 🔮 A smol coding agent in Elixir☆43Mar 22, 2026Updated last month
- ☆49May 3, 2020Updated 6 years ago
- Rainbowgram with Python☆13Jan 28, 2019Updated 7 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Generates spectrogram from images☆13Apr 26, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Grad-CAM (Gradient-weighted Class Activation Mapping)☆13Dec 20, 2019Updated 6 years ago
- Baseline Python Scripts for Popular Kaggle Competitions☆17Aug 20, 2022Updated 3 years ago
- ☆12May 1, 2019Updated 7 years ago
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 8 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆10Jun 22, 2020Updated 5 years ago
- Using Deep Learning for singing voice separation - Project for the course DT2119 Speech and Speaker Recognition offered by KTH in 2018☆15Jun 16, 2018Updated 7 years ago
- A blender addon that lets you open individual scripts from Blender's text editor in an IDE of your choice, without having to save the fil…☆13Aug 23, 2022Updated 3 years ago