Audio source separation and speech enhancement

Learn the technology behind hearing aids, Siri, and Echo Audio source separation and speech enhancement aim to extract one or more source signals of interest from an audio recording involving several sound sources. These technologies are among the most studied in audio signal processing today and be...

Full description

Bibliographic Details
Other Authors: Vincent, Emmanuel (Editor), Virtanen, Tuomas (Editor), Gannot, Sharon (Editor)
Format: eBook
Published: Hoboken, NJ John Wiley & Sons 2018
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
Table of Contents:
  • Includes bibliographical references and index
  • Part I: Prerequisites: Introduction / Emmanuel Vincent, Sharon Gannot, and Tuomas Virtanen
  • Time-frequency processing : spectral properties / Tuomas Virtanen, Emmanuel Vincent, and Sharon Gannot
  • Acoustics : spatial properties / Emmanuel Vincent, Sharon Gannot, and Tuomas Virtanen
  • Multichannel source activity detection, localization, and tracking / Pasi Pertilä, Alessio Brutti, Piergiogio Svaizer, and Maurizio Omologo
  • Preface / Emmanuel Vincent, Tuomas Virtanen, Sharon Gannot
  • Part III: Multichannel Separation and Enhancement: Spatial filtering / Shmulik Markovich-Golan, Walter Kellermann, and Sharon Gannot
  • Multichannel parameter estimation / Shmulik Markovich-Golan, Walter Kellermann, and Sharon Gannot
  • Multichannel clustering and classification approaches / Michael I. Mandel, Shoko Araki, and Tomohiro Nakatani
  • Independent component and vector analysis / Hiroshi Sawada and Zbyněk Kokdovský
  • Gaussian model based multichannel separation / Alexey Ozerov and Hirokazu Kameoka
  • Dereverberation / Emanuël A.P. Habets and Patrick A. Naylor
  • Part II: Single-Channel Separation and Enhancement: Spectral masking and filtering / Timo Gerkmann and Emmanuel Vincent
  • Single-channel speech presence probability estimation and noise tracking / Rainer Martin and Israel Cohen
  • Single-channel classivication and clustering approaches / Felix Weninger, Jun Du, Erik Marchi, and Tian Gao
  • Nonnegative matrix factorization / Roland Badeau and Tuomas Virtanen
  • Temporal extensions of nonegative matrix factorization / Cédric Févotte, Paris Smaragdis, Nasser Mohammadiha, and Gautham J. Mysore
  • Part IV: Application Scenarios and Perspectives: Applying source separation to music / Bryan Pardo, Antoine Liutkus, Zhiyao Duan, and Gaël Richard
  • Application of source separation to robust speech analysis and recognition / Shinji Watanabe, Tuomas Virtanen, and Dorothea Kolossa
  • Binaural speech processing with application to hearing devices / Simon Doclo, Sharon Gannot, Daniel Marquardt, and Elior Hadad
  • Perspectives / Emmanual Vincent, Tuomas Virtanen, and Sharon Gannot