☆138May 13, 2025Updated last year
Alternatives and similar repositories for PretrainedSED
Users that are interested in PretrainedSED are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jun 12, 2025Updated last year
- ☆29Oct 17, 2024Updated last year
- Prediction of sound event bounding boxes (SEBBs)☆35Aug 2, 2024Updated last year
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆167Updated this week
- Source code for Consistent ensemble distillation for audio tagging☆70Mar 20, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated last year
- ☆42Feb 18, 2026Updated 3 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆53May 1, 2025Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆30Mar 10, 2024Updated 2 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 6 months ago
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆22Mar 2, 2026Updated 3 months ago
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆140Sep 2, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆39Jul 4, 2024Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆141Sep 25, 2025Updated 8 months ago
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆101Feb 10, 2026Updated 4 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆156Feb 23, 2026Updated 3 months ago
- Official code for SongEcho☆64Mar 3, 2026Updated 3 months ago
- Efficient Training of Audio Transformers with Patchout☆382Jan 12, 2024Updated 2 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆66Dec 26, 2025Updated 5 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆347Nov 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 28, 2023Updated 2 years ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆130Jun 11, 2024Updated 2 years ago
- ☆44Jan 13, 2025Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆204Dec 13, 2024Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆86Nov 7, 2025Updated 7 months ago
- EVAR ~ Evaluation package for Audio Representations☆80Feb 19, 2026Updated 3 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆23Jul 10, 2024Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- ☆33Dec 23, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆25Jun 2, 2026Updated last week
- Repo associated to the DESED dataset, download and creation of data☆152Jul 16, 2024Updated last year
- Extract phoneme-level timestamps from speeh audio.☆142Jun 7, 2026Updated last week
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆18Nov 19, 2024Updated last year
- ☆13Jan 3, 2024Updated 2 years ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year