Results for "waveform synthesis"
Generating human-like speech from text.
Generates audio waveforms from spectrograms.
Generating speech audio from text, with control over prosody, speaker identity, and style.
Models that learn to generate samples resembling training data.
Two-network setup where generator fools a discriminator.
Changing speaker characteristics while preserving content.
Aligns transcripts with audio timestamps.
Control that remains stable under model uncertainty.