Low-Latency Pitch-Shifting with STN Decomposition
* Presenting author
Abstract:
A new approach for low-latency pitch-shifting of audio signals is presented. The proposed solution integrates fuzzy Signal-Transient-Noise (STN) decomposition into the processing pipeline. By separating input audio into harmonic, transient, and noise components, the system can apply specialized pitch-shifting techniques for each stream, addressing common pitch-shifting artifacts such as transient smearing and phasiness. The proposed method employs a phase vocoder for sines, preserves transients and processes noise with a Noise Morphing algorithm. Changes to the Fuzzy STN and Noise Morphing algorithms needed for online processing are proposed. Implemented as a VST audio plug-in, the system supports pitch adjustments across a broad range of semitones with user-configurable parameters. Evaluation, in the form of a blind listening test and an informal interview, reveals limitations in audio quality compared to state-of-the-art commercial solutions. However, the alignment with emerging music trends suggests potential for artistic applications. The study concludes with recommendations for improving audio quality, reducing computational overhead, and expanding evaluation methodologies.