검색 상세

Sound Event Localization and Detection Using Dual Cross-modal Attention and Parameter Sharing

크로스모달 어텐션 및 파라미터 공유 기법을 활용한 음향 이벤트 판별 및 방향 탐지

초록

Sound event localization and detection is a joint task that unifies sound event detection and directions of arrival (DOA) estimation. It is reasonable to combine detection and localization by estimating the temporal and spatial locations of the targe events since sound of an event is transmitted to microphones from the corresponding source at a specific direction. The task has become a popular topic so that it was introduced into the challenge on Detection and Classification of Acoustic Scenes and Events (DCASE) Task3 in 2019. In this thesis, we propose a method based on dual cross-modal attention (DCMA) and parameter sharing to simultaneously detect and localize sound events. Furthermore, we introduce various data augmentation methods and diverse types of acoustic features. Experimental results show the proposed system outperformed the baseline method significantly. In addition, our model adopting the track-wise output format achieved much larger LR_CD than the highly ranked systems in the challenge.

more