Pitch Shifting of Audio Signals Using the Constant-Q Transform

Christian Schörkhuber; Anssi Klapuri; Alois Sontacchi
DAFx-2012 - York
Pitch-scale modifications of polyphonic music are usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are thereby based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analysing and processing music signals. Recently invertible constant-Q transforms (CQT) featuring high Q-factors have been proposed exhibiting a more suitable geometrical bin spacing. In this paper a frequency domain pitch-shifting approach based on the CQT is proposed. The CQT is specifically attractive for pitch-shifting because it can be implemented by frequency translation (shifting partials along the frequency axis) as opposed to spectral stretching in the Fourier transform domain. Furthermore, the high time resolution of CQT at high frequencies improves transient preservation. Audio examples are provided to illustrate the results achieved with the proposed method.
Download