A Method for Automatic Whoosh Sound Description

A4 Konferenspublikationer


Interna författare/redaktörer


Publikationens författare: Eugene Cherny, Johan Lilius, Dmitry Mouromtsev
Redaktörer: Alberto Torin, Brian Hamilton, Stefan Bilbao, Michael Newton
Förlagsort: Edinburgh
Publiceringsår: 2017
Tidskrift: Proceedings of the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK, September 5–9, 2017
Moderpublikationens namn: Proceedings of the 20th International Conference on Digital Audio Effects (DAFx-17), Edinburgh, UK, September 5–9, 2017
Artikelns första sida, sidnummer: 459
Artikelns sista sida, sidnummer: 465
ISSN: 2413-6700


Abstrakt

Usually, a sound designer achieves artistic goals by editing and processing the
pre-recorded sound samples. To assist navigation in the vast amount of sounds,
the sound metadata is used: it provides small free-form textual descriptions of
the sound file content. One can search through the keywords or phrases in the
metadata to find a group of sounds that can be suitable for a task.
Unfortunately, the relativity of the sound design terms complicate the search,
making the search process tedious, prone to errors and by no means supportive of
the creative flow. Another way to approach the sound search problem is to use
sound analysis. In this paper we present a simple method for analyzing the
temporal evolution of the ``whoosh'' sound, based on the per-band piecewise
linear function approximation of the sound envelope signal. The method uses
spectral centroid and fuzzy membership functions to estimate a degree to which
the sound energy moves upwards or downwards in the frequency domain along the
audio file. We evaluated the method on a generated dataset, consisting of white
noise recordings processed with different variations of modulated bandpass
filters. The method was able to correctly identify the centroid movement
directions in 77% sounds from a synthetic dataset.


Dokument


Senast uppdaterad 2020-19-02 vid 03:14