A Method for Automatic Whoosh Sound Description

A4 Conference proceedings


Internal Authors/Editors


Publication Details

List of Authors: Eugene Cherny, Johan Lilius, Dmitry Mouromtsev
Editors: Alberto Torin, Brian Hamilton, Stefan Bilbao, Michael Newton
Place: Edinburgh
Publication year: 2017
Journal: Proceedings of the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK, September 5–9, 2017
Book title: Proceedings of the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK, September 5–9, 2017
Start page: 459
End page: 465
ISSN: 2413-6700


Abstract

Usually, a sound designer achieves artistic goals by editing and processing the
pre-recorded sound samples. To assist navigation in the vast amount of sounds,
the sound metadata is used: it provides small free-form textual descriptions of
the sound file content. One can search through the keywords or phrases in the
metadata to find a group of sounds that can be suitable for a task.
Unfortunately, the relativity of the sound design terms complicate the search,
making the search process tedious, prone to errors and by no means supportive of
the creative flow. Another way to approach the sound search problem is to use
sound analysis. In this paper we present a simple method for analyzing the
temporal evolution of the ``whoosh'' sound, based on the per-band piecewise
linear function approximation of the sound envelope signal. The method uses
spectral centroid and fuzzy membership functions to estimate a degree to which
the sound energy moves upwards or downwards in the frequency domain along the
audio file. We evaluated the method on a generated dataset, consisting of white
noise recordings processed with different variations of modulated bandpass
filters. The method was able to correctly identify the centroid movement
directions in 77% sounds from a synthetic dataset.


Documents


Last updated on 2019-21-09 at 03:19