lucasgattas / ComfyUI-Egregora-Audio-Super-ResolutionLinks
✨ High‑quality music audio enhancement for ComfyUI: FlashSR Super‑Resolution + Fat Llama spectral enhancement (GPU & CPU).
☆41Updated 2 months ago
Alternatives and similar repositories for ComfyUI-Egregora-Audio-Super-Resolution
Users that are interested in ComfyUI-Egregora-Audio-Super-Resolution are comparing it to the libraries listed below
Sorting:
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆124Updated 2 weeks ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆78Updated 5 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆116Updated 4 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆36Updated last month
- Awesome music generation model——MG²☆165Updated 8 months ago
- ☆11Updated last year
- YuE with mp3 extend, exllama and GUI☆64Updated 10 months ago
- 基于FreeVC的歌声转换☆21Updated 3 years ago
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆58Updated last month
- ☆48Updated 5 months ago
- RVC Onnx Infer- Upgraded and simplified-ish☆25Updated last year
- Codebase and project page for EDMSound☆35Updated 2 years ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆113Updated last year
- ☆91Updated 2 months ago
- Real-time end-to-end singing voice convertion☆22Updated last year
- ☆83Updated last year
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆49Updated 4 months ago
- ☆183Updated last month
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆25Updated 3 weeks ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆119Updated 4 months ago
- ☆51Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆53Updated last month
- Controlled audio inpainting using SD-fine tuned model Riffusion in a ControlNet Architecture☆32Updated 2 years ago
- List of Podcast Feeds using iTunes API and script to download 6,000,000~ hours of English speech.☆31Updated 2 years ago
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆30Updated 5 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆44Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆42Updated last year
- 4G GPU & 10 Minutes for train☆12Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆87Updated 5 months ago