AUDITORY CORTEX-INSPIRED SPECTRAL ATTENTION MODULATION FOR BINAURAL SOUND LOCALIZATION IN HRTF MISMATCH - Apprentissage pour les Systèmes Intelligents en Milieux OuVerts
Communication Dans Un Congrès Année : 2024

AUDITORY CORTEX-INSPIRED SPECTRAL ATTENTION MODULATION FOR BINAURAL SOUND LOCALIZATION IN HRTF MISMATCH

Résumé

In applications like noise cancellation and virtual reality, precise sound source localization is crucial. Existing datadriven binaural systems offer high performance in adverse conditions such as noise and reverberation but face limitations with real-time operation and performance degradation in HRTF mismatch scenarios. Our work introduces a compact Vision Transformer tailored to address these issues, with a primary focus on horizontal speech localization. Inspired by the auditory cortex, our model uniquely incorporates spectral attention mechanisms using encoded speech representations. This architecture enhances generalization on the azimuth plane under mismatched HRTFs. Our empirical results show a marked improvement over conventional DNN, CNN-based and Transformer-based models, both in noisy and noise-free environments. Significantly, the proposed model maintains high accuracy in localizing adjacent azimuths, ideal for realworld applications.
Fichier principal
Vignette du fichier
ICASSP2024_FINAL_preprint.pdf (291.11 Ko) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04722068 , version 1 (04-10-2024)

Identifiants

  • HAL Id : hal-04722068 , version 1

Citer

Waradon Phokhinanan, Nicolas Obin, Sylvain Argentieri. AUDITORY CORTEX-INSPIRED SPECTRAL ATTENTION MODULATION FOR BINAURAL SOUND LOCALIZATION IN HRTF MISMATCH. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Apr 2024, Seoul (Korea), South Korea. ⟨hal-04722068⟩
15 Consultations
9 Téléchargements

Partager

More