Better Intelligibility: Reduced Accent Leakage in Cross-Lingual Synthesis
Under the same prompt and target text, we compare Chinese outputs across models. Our model preserves speaker identity while producing cleaner pronunciation with less accent leakage.
Reference (Italian)
Target text: (Chinese) 凯尔投资的钱与其他投资者一样人间蒸发。