Abstract: The increasing ability of deep learning models to produce realistic-sounding synthetic speech poses serious problems for privacy, public trust, and digital security. To counter this danger, ...
Abstract: In this paper, we propose a deep learning (DL)-based task-driven spectrum prediction framework, named DeepSPred. The DeepSPred comprises a feature encoder and a task predictor, where the ...
This project adapts the framework introduced by Carlsson et al. in On the Local Behavior of Spaces of Natural Images (2008) to the domain of audio signals. Where the original work revealed that ...