Abstract: Recent work has studied text-to-audio synthesis using large amounts of paired text-audio data. However, audio recordings with high-quality text annotations can be difficult to acquire. In ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results