TY - GEN
T1 - Automatic sound recognition of urban environment events
AU - Theodorou, Theodoros
AU - Mporas, Iosif
AU - Fakotakis, Nikos
PY - 2015/1/1
Y1 - 2015/1/1
N2 - The audio analysis of speaker’s surroundings has been a first step for several processing systems that enable speaker’s mobility though his daily life. These algorithms usually operate in a short-time analysis decomposing the incoming events in time and frequency domain. In this paper, an automatic sound recognizer is studied, which investigates audio events of interest from urban environment. Our experiments were conducted using a close set of audio events from which well known and commonly used audio descriptors were extracted and models were training using powerful machine learning algorithms. The best urban sound recognition performance was achieved by SVMs with accuracy equal to approximately 93%.
AB - The audio analysis of speaker’s surroundings has been a first step for several processing systems that enable speaker’s mobility though his daily life. These algorithms usually operate in a short-time analysis decomposing the incoming events in time and frequency domain. In this paper, an automatic sound recognizer is studied, which investigates audio events of interest from urban environment. Our experiments were conducted using a close set of audio events from which well known and commonly used audio descriptors were extracted and models were training using powerful machine learning algorithms. The best urban sound recognition performance was achieved by SVMs with accuracy equal to approximately 93%.
KW - Automatic sound recognition
KW - Dimensionality redundancy
KW - Urban environment
UR - http://www.scopus.com/inward/record.url?scp=84945977634&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-23132-7_16
DO - 10.1007/978-3-319-23132-7_16
M3 - Conference contribution
AN - SCOPUS:84945977634
SN - 9783319231310
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 129
EP - 136
BT - Speech and Computer - 17th International Conference, SPECOM 2015, Proceedings
A2 - Ronzhin, Andrey
A2 - Potapova, Rodmonga
A2 - Fakotakis, Nikos
PB - Springer Nature Link
T2 - 17th International Conference on Speech and Computer, SPECOM 2015
Y2 - 20 September 2015 through 24 September 2015
ER -