NVIDIA / elucidated-text-to-audioLinks

Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.
33Updated last week

Alternatives and similar repositories for elucidated-text-to-audio

Users that are interested in elucidated-text-to-audio are comparing it to the libraries listed below

Sorting: