Principled probing of foundation models in the auditory modality - Fédération Fabri de Peiresc
Communication Dans Un Congrès Année : 2024

Principled probing of foundation models in the auditory modality

Sondage motivé théoriquement des modèles de fondation dans la modalité auditive

Résumé

We leverage ecological theories of sound perception in humans and a carefully designed dataset of perceptually calibrated sounds to develop and carry out principled fine-grained probing of foundation models in relation to the auditory modality. We show that internal activations of the state-of-the-art audio foundation model BEATs correlate better with perceptual dimensions than a supervised audio classification model and a text-audio multimodal model and that all models fail to represent at least one perceptual dimension. We also report preliminary evidence suggesting that directions aligning invariantly with a perceptual dimension can be identified within the representation space at inner layers of the BEATs model. We briefly discuss future work and potential applications.
Fichier principal
Vignette du fichier
85_Principled_probing_of_found.pdf (1.06 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04869548 , version 1 (07-01-2025)

Licence

Identifiants

  • HAL Id : hal-04869548 , version 1

Citer

Etienne Bost, Mitsuko Aramaki, Richard Kronland-Martinet, Sølvi Ystad, Thierry Artières, et al.. Principled probing of foundation models in the auditory modality. NeurIPS 2024 Workshop on Behavioral Machine Learning, Dec 2024, Vancouver (BC), Canada. ⟨hal-04869548⟩
0 Consultations
0 Téléchargements

Partager

More