☆24Jul 17, 2024Updated last year
Alternatives and similar repositories for stable-audio-2-demo
Users that are interested in stable-audio-2-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official codebase for Reflected Flow Matching (ICML 2024)☆23Jun 19, 2024Updated last year
- ☆11May 7, 2022Updated 4 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Nov 28, 2021Updated 4 years ago
- PyTorch implementation of the paper Learning Multi-Level Representations for Hierarchical Music Structure Analysis presented at ISMIR 202…☆14Jan 2, 2023Updated 3 years ago
- A Datasette instance for searching WebVid-10M☆15Sep 30, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Mar 20, 2025Updated last year
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆289May 12, 2026Updated 2 weeks ago
- Simple ray tracing engine written in C++ using Qt.☆12Jun 4, 2013Updated 12 years ago
- ☆10Jun 19, 2019Updated 6 years ago
- Zero-data (yet trainable) probabilistic fundamental frequency estimator.☆19Jun 9, 2018Updated 7 years ago
- ☆20Dec 8, 2024Updated last year
- Repo to contain the public code for the ACL2017 poetry paper.☆16Aug 14, 2017Updated 8 years ago
- 本项目包含一个 Python 脚本,用于分离双人(或多人)对话播客音频文件中的不同说话人语音。它利用 `pyannote.audio` 库进行说话人日志分析(Speaker Diarization),找出“谁在什么时候说话”,并将每个说话人的语音片段提取到单独的音轨中。☆15Apr 30, 2025Updated last year
- ☆18Jul 17, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆18Mar 12, 2020Updated 6 years ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Sep 11, 2024Updated last year
- Assistant of ZJU score.☆10Jun 25, 2025Updated 11 months ago
- ☆13Jan 5, 2018Updated 8 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- ☆193Nov 19, 2025Updated 6 months ago
- A procedural quest generator using Tracery☆10Feb 14, 2017Updated 9 years ago
- 语音合成服务☆12Mar 18, 2023Updated 3 years ago
- Guide on how to set up openai gym and mujoco for deep reinforcement learning research.☆16Jan 12, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple tool to guess an HuggingFace repo URL from a state dict.☆50Oct 29, 2024Updated last year
- The implementation of MDNet, which is in submission to Interspeech2022☆14May 1, 2022Updated 4 years ago
- Chinese-Handwriting-Tool☆13Nov 11, 2023Updated 2 years ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- ☆16Jun 15, 2022Updated 3 years ago
- ☆11Feb 8, 2024Updated 2 years ago
- Real Time Reflections in OpenGL using screen space techniques☆38Apr 29, 2012Updated 14 years ago
- ☆18May 14, 2025Updated last year
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆13Jun 15, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Jun 9, 2025Updated 11 months ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- frame interpolation for CLIP guided videos☆15Aug 18, 2022Updated 3 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- Generative models for conditional audio generation☆3,731Updated this week
- Playing around with procedural generation in Unity☆17Oct 22, 2016Updated 9 years ago
- ☆11Jun 2, 2019Updated 6 years ago