Perceptually Controlled Reshaping of Sound Histograms

Abstract : Many audio processing algorithms have optimal performance for specific signal statistical distributions that may not be fulfilled for all signals. When the original signal is available, we propose to add an inaudible noise so that the distribution of the signal-plus-noise mixture is as close as possible to a given target distribution. The proposed generic algorithm (independent from the application) adds iteratively a low-power white noise to a flat-spectrum version of the signal, until the target distribution or the noise audibility is reached. The latter is assessed through a frequency masking model. Two implementations of this sound reshaping are described, according to the level of the targeted transformation and to the foreseen application: Histogram Global Reshaping (HGR) to change the global shape of the histogram and Histogram Local Reshaping (HLR) to locally " chisel " the histogram, but keeping the global shape unchanged. These two variants are illustrated by two applications where the inaudibility of the noise generated by the algorithm is required: " sparsification " for source separation, and low-pass filtering of the histogram for application of the quantization theorem, respectively. In both cases, the target histogram is reached or almost reached and the transformation is inaudible. The experiments show that the source separation performs better with HGR and that the HLR allows a better application of the quantization theorem.
Type de document :
Article dans une revue
Liste complète des métadonnées

Littérature citée [34 références]  Voir  Masquer  Télécharger

https://hal-descartes.archives-ouvertes.fr/hal-01828960
Contributeur : Gaël Mahé <>
Soumis le : vendredi 13 juillet 2018 - 15:36:08
Dernière modification le : jeudi 11 avril 2019 - 16:02:18
Archivage à long terme le : lundi 1 octobre 2018 - 09:16:17

Fichier

SoundHistogramReshaping_TASLP2...
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Gaël Mahé, Mériem Jaidane. Perceptually Controlled Reshaping of Sound Histograms. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018, 26 (9), pp.1671 - 1683. ⟨10.1109/TASLP.2018.2836143⟩. ⟨hal-01828960⟩

Partager

Métriques

Consultations de la notice

65

Téléchargements de fichiers

176