ILRDF Formosan Text-To-Speech System
The model tends to produce silences, especially on longer audio. We can manually remove silences if needed. Note that this is an experimental feature and may produce strange results. This will also increase generation time.
語速(越小越慢)
Set the number of denoising steps.
Set the duration of the cross-fade between audio clips.