Speech dereverberation constrained on room impulse response characteristics

Louis Bahrman; Mathieu Fontaine; Jonathan Le Roux; Gaël Richard

Communication Dans Un Congrès Année : 2024

Speech dereverberation constrained on room impulse response characteristics

(1, 2) , (1, 2) , (3) , (1, 2)

1
2
3

Louis Bahrman

Fonction : Auteur
PersonId : 1179676
IdHAL : louis-bahrman
ORCID : 0000-0002-4207-2067

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Mathieu Fontaine

Fonction : Auteur
PersonId : 13405
IdHAL : mathieu-fontaine
ORCID : 0000-0002-7657-6271
IdRef : 236886681

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Jonathan Le Roux

Fonction : Auteur
PersonId : 1399082

Mitsubishi Electric Research Laboratories

Gaël Richard

Fonction : Auteur
PersonId : 14146
IdHAL : gael-richard
IdRef : 094977208

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Résumé

Single-channel speech dereverberation aims at extracting a dry speech signal from a recording affected by the acoustic reflections in a room. However, most current deep learning-based approaches for speech dereverberation are not interpretable for room acoustics, and can be considered as black-box systems in that regard. In this work, we address this problem by regularizing the training loss using a novel physical coherence loss which encourages the room impulse response (RIR) induced by the dereverberated output of the model to match the acoustic properties of the room in which the signal was recorded. Our investigation demonstrates the preservation of the original dereverberated signal alongside the provision of a more physically coherent RIR.

Mots clés

Speech dereverberation hybrid deep learning room acoustics acoustic matching speech processing

Domaines

Traitement du signal et de l'image [eess.SP] Intelligence artificielle [cs.AI]

Fichier principal

camera_ready.pdf (1.62 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Louis Bahrman : Connectez-vous pour contacter le contributeur

https://telecom-paris.hal.science/hal-04640068

Soumis le : mardi 9 juillet 2024-14:25:41

Dernière modification le : mercredi 23 octobre 2024-10:30:04

Dates et versions

hal-04640068 , version 1 (09-07-2024)

Identifiants

HAL Id : hal-04640068 , version 1

Citer

Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard. Speech dereverberation constrained on room impulse response characteristics. INTERSPEECH, Sep 2024, Kos Island, Greece. ⟨hal-04640068⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

GENCI LTCI IDS S2A IP_PARIS INSTITUT-MINES-TELECOM

554 Consultations

147 Téléchargements

Speech dereverberation constrained on room impulse response characteristics

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager