NVIDIA / audio-intelligenceLinks
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.
☆93Updated 2 months ago
Alternatives and similar repositories for audio-intelligence
Users that are interested in audio-intelligence are comparing it to the libraries listed below
Sorting:
- ☆45Updated last year
- ☆81Updated 5 months ago
- The open-source code of UniAudio2.0