2 results
Generating speech audio from text, with control over prosody, speaker identity, and style.
Assigning a role or identity to the model.