Contribution

Low-Latency Pitch-Shifting with STN Decomposition

Authors

* Presenting author
Day / Time: 20.03.2025, 11:00-11:40
Typ: Poster
Information: The posters will be exhibited in Hall E north from Tuesday to Thursday, sorted by thematic context in the poster island indicated in the session title. The poster session at the specified time offers the opportunity to enter into discussion with the authors.
Abstract: A new approach for low-latency pitch-shifting of audio signals is presented. The proposed solution integrates fuzzy Signal-Transient-Noise (STN) decomposition into the processing pipeline. By separating input audio into harmonic, transient, and noise components, the system can apply specialized pitch-shifting techniques for each stream, addressing common pitch-shifting artifacts such as transient smearing and phasiness. The proposed method employs a phase vocoder for sines, preserves transients and processes noise with a Noise Morphing algorithm. Changes to the Fuzzy STN and Noise Morphing algorithms needed for online processing are proposed. Implemented as a VST audio plug-in, the system supports pitch adjustments across a broad range of semitones with user-configurable parameters. Evaluation, in the form of a blind listening test and an informal interview, reveals limitations in audio quality compared to state-of-the-art commercial solutions. However, the alignment with emerging music trends suggests potential for artistic applications. The study concludes with recommendations for improving audio quality, reducing computational overhead, and expanding evaluation methodologies.