Sparse Overcomplete Representations of Audio:
A Way to Combine Signal Models with
Waveform Analysis / Synthesis.

 

Professor Laurent Daudet

Laboratoire d'Acoustique Musicale

Université Pierre et Marie Curie (Paris 6)


Date: Friday, January 12, 2007
Place: Buchanan 1930
Time: 2:00 pm 3:00 pm


Abstract:

This talk will be focused on signal modeling using sparse decompositions on overcomplete dictionaries, with a strong focus on audio signals. In such models, a signal is approximated by a small number of elementary waveforms ("atoms") taken from a large collection ("dictionary"). Sparse decompositions offer a greater flexibility over fixed orthogonal bases: for instance one is not limited by the use of a single window size, which avoids--apparently at least--uncertainty constraints. The price paid is the non-uniqueness of the decomposition. Finding the best decomposition for a given signal is in general not possible in finite time. However, greedy suboptimal techniques have been developed that provide near-optimal decompositions at a reasonable computational cost. These methods are thus applicable to multimedia data. In the work presented here, we detail techniques for including signal models (for instance structure constraints between coefficients) directly at the decomposition stage, by grouping together semantically relevant atoms into clusters ("molecules"). We will discuss possible applications for scalable coding and audio classification.

 

 

LAURENT DAUDET studied at the Ecole Normale Supérieure, Paris, France, from 1993 to 1997, where he received a degree in statistical and nonlinear physics. In 2000, he received a Ph.D. degree in mathematical modeling from the Université de Provence, Marseille, France, on audio coding and physical modeling of piano strings. In 2001 and 2002, he was a EU Marie Curie post-doctoral fellow at the Department of Electronic Engineering, Queen Mary, University of London, U.K. Since 2002, he has been a Lecturer at the Université Pierre et Marie Curie (Paris 6), Paris, France's leading science university, where he now heads a small audio signal processing team at the Laboratory for Musical Acoustics (LAM). His research interests include audio coding and indexing, time-frequency and time-scale transforms, analysis of transient signals, and sparse representations for audio.

 

 

Host: Professor Curtis Roads, Media Arts and Technology