MiscellaneousStuff / PhoneLM

(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
46Updated last year

Related projects: