Recent Advances in Robust Speech Recognition Technology

View Sample

Chapter 13 Reviewing Feature Non-Linear Transformations for Robust Speech Recognition

Luz Garcia, Jose Carlos Segura and Angel de la Torre

Abstract

The aim of Robust Speech Recognition is to reduce as much as possible the environmental mismatch between the training and test conditions in order to optimally use the acoustic models in the recognition process. There are several factors producing such mismatch: inter-speaker variability, intra-speaker variability, and changes in the speaker environment or in the channel characteristics. The changes in the environment represent a challenging area of work and constitute one of the main driving forces of research in voice processing, that nowadays faces application scenarios like mobile phones, moving cars, spontaneous speech, speech masked by other speech, speech masked by music or non-stationary noises. The different strategies that fight the effects of additive noise in the voice signal and the recognition process will be summarized in this review, focusing in the normalization techniques and particularly in the non linear transformations of the MFCC features. Histogram Equalization and Parametric Histogram Equalization with their variants and evolutions will be analyzed as main representatives of this family of non-linear feature transformations.

Total Pages: 190-196 (7)

Purchase Chapter Book Details

Bookshelf

Book Categories

What's new

Future Attractions

For Reviewers

For Buyers and Librarians

For Authors and Editors

Marketing Opportunities

Advertising

General Queries

Bookshelf

Book Categories

What's new

Future Attractions

For Reviewers

For Buyers and Librarians

For Authors and Editors

Marketing Opportunities

Advertising

General Queries

Chapter 13

Reviewing Feature Non-Linear Transformations for Robust Speech Recognition

Luz Garcia, Jose Carlos Segura and Angel de la Torre

Abstract

RELATED BOOKS

Site Breadcrumb

Chapter 13

Reviewing Feature Non-Linear Transformations for Robust Speech Recognition

Luz Garcia, Jose Carlos Segura and Angel de la Torre

Abstract

RELATED BOOKS