Fractional Fourier Transform Techniques for Speech Enhancement

2020-04-16 Prajna Kunche

Author : Prajna Kunche
Release : 2020-04-16
Genre : Technology & Engineering
Kind :
Book Rating : 463/5 ( reviews)

Fractional Fourier Transform Techniques for Speech Enhancement - read free eBook in online reader or directly download on the web page. Select files or add your book in reader. Download and read online ebook Fractional Fourier Transform Techniques for Speech Enhancement write by Prajna Kunche. This book was released on 2020-04-16. Fractional Fourier Transform Techniques for Speech Enhancement available in PDF, EPUB and Kindle. This book explains speech enhancement in the Fractional Fourier Transform (FRFT) domain and investigates the use of different FRFT algorithms in both single channel and multi-channel enhancement systems, which has proven to be an ideal time frequency analysis tool in many speech signal processing applications. The authors discuss the complexities involved in the highly non- stationary signal processing and the concepts of FRFT for speech enhancement applications. The book explains the fundamentals of FRFT as well as its implementation in speech enhancement. Theories of different FRFT methods are also discussed. The book lets readers understand the new fractional domains to prepare them to develop new algorithms. A comprehensive literature survey regarding the topic is also made available to the reader.

New Approaches for Speech Enhancement in the Short-Time Fourier Transform Domain

2016 Mahdi Parchami

Author : Mahdi Parchami
Release : 2016
Genre :
Kind :
Book Rating : /5 ( reviews)

New Approaches for Speech Enhancement in the Short-Time Fourier Transform Domain - read free eBook in online reader or directly download on the web page. Select files or add your book in reader. Download and read online ebook New Approaches for Speech Enhancement in the Short-Time Fourier Transform Domain write by Mahdi Parchami. This book was released on 2016. New Approaches for Speech Enhancement in the Short-Time Fourier Transform Domain available in PDF, EPUB and Kindle. Speech enhancement aims at the improvement of speech quality by using various algorithms. A speech enhancement technique can be implemented as either a time domain or a transform domain method. In the transform domain speech enhancement, the spectrum of clean speech signal is estimated through the modification of noisy speech spectrum and then it is used to obtain the enhanced speech signal in the time domain. Among the existing transform domain methods in the literature, the short-time Fourier transform (STFT) processing has particularly served as the basis to implement most of the frequency domain methods. In general, speech enhancement methods in the STFT domain can be categorized into the estimators of complex discrete Fourier transform (DFT) coefficients and the estimators of real-valued short-time spectral amplitude (STSA). Due to the computational efficiency of the STSA estimation method and also its superior performance in most cases, as compared to the estimators of complex DFT coefficients, we focus mostly on the estimation of speech STSA throughout this work and aim at developing algorithms for noise reduction and reverberation suppression. First, we tackle the problem of additive noise reduction using the single-channel Bayesian STSA estimation method. In this respect, we present new schemes for the selection of Bayesian cost function parameters for a parametric STSA estimator, namely the W?-SA estimator, based on an initial estimate of the speech and also the properties of human auditory system. We further use the latter information to design an efficient flooring scheme for the gain function of the STSA estimator. Next, we apply the generalized Gaussian distribution (GGD) to theW?-SA estimator as the speech STSA prior and propose to choose its parameters according to noise spectral variance and a priori signal to noise ratio (SNR). The suggested STSA estimation schemes are able to provide further noise reduction as well as less speech distortion, as compared to the previous methods. Quality and noise reduction performance evaluations indicated the superiority of the proposed speech STSA estimation with respect to the previous estimators. Regarding the multi-channel counterpart of the STSA estimation method, first we generalize the proposed single-channel W?-SA estimator to the multi-channel case for spatially uncorrelated noise. It is shown that under the Bayesian framework, a straightforward extension from the single-channel to the multi-channel case can be performed by generalizing the STSA estimator parameters, i.e. ? and ?. Next, we develop Bayesian STSA estimators by taking advantage of speech spectral phase rather than only relying on the spectral amplitude of observations, in contrast to conventional methods. This contribution is presented for the multi-channel scenario with single-channel as a special case. Next, we aim at developing multi-channel STSA estimation under spatially correlated noise and derive a generic structure for the extension of a single-channel estimator to its multi-channel counterpart. It is shown that the derived multi-channel extension requires a proper estimate of the spatial correlation matrix of noise. Subsequently, we focus on the estimation of noise correlation matrix, that is not only important in the multi-channel STSA estimation scheme but also highly useful in different beamforming methods. Next, we aim at speech reverberation suppression in the STFT domain using the weighted prediction error (WPE) method. The original WPE method requires an estimate of the desired speech spectral variance along with reverberation prediction weights, leading to a sub-optimal strategy that alternatively estimates each of these two quantities. Also, similar to most other STFT based speech enhancement methods, the desired speech coefficients are assumed to be temporally independent, while this assumption is inaccurate. Taking these into account, first, we employ a suitable estimator for the speech spectral variance and integrate it into the estimation of the reverberation prediction weights. In addition to the performance advantage with respect to the previous versions of the WPE method, the presented approach provides a good reduction in implementation complexity. Next, we take into account the temporal correlation present in the STFT of the desired speech, namely the inter-frame correlation (IFC), and consider an approximate model where only the frames within each segment of speech are considered as correlated. Furthermore, an efficient method for the estimation of the underlying IFC matrix is developed based on the extension of the speech variance estimator proposed previously. The performance results reveal lower residual reverberation and higher overall quality provided by the proposed method. Finally, we focus on the problem of late reverberation suppression using the classic speech spectral enhancement method originally developed for additive noise reduction. As our main contribution, we propose a novel late reverberant spectral variance (LRSV) estimator which replaces the noise spectral variance in order to modify the gain function for reverberation suppression. The suggested approach employs a modified version of the WPE method in a model based smoothing scheme used for the estimation of the LRSV. According to the experiments, the proposed LRSV estimator outperforms the previous major methods considerably and scores the closest results to the theoretically true LRSV estimator. Particularly, in case of changing room impulse responses (RIRs) where other methods cannot follow the true LRSV estimator accurately, the suggested estimator is able to track true LRSV values and results in a smaller tracking error. We also target a few other aspects of the spectral enhancement method for reverberation suppression, which were explored before only for the purpose of noise reduction. These contributions include the estimation of signal to reverberant ratio (SRR) and the development of new schemes for the speech presence probability (SPP) and spectral gain flooring in the context of late reverberation suppression.

DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement

2013-01-01 Richard C. Hendriks

Author : Richard C. Hendriks
Release : 2013-01-01
Genre : Technology & Engineering
Kind :
Book Rating : 449/5 ( reviews)

DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement - read free eBook in online reader or directly download on the web page. Select files or add your book in reader. Download and read online ebook DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement write by Richard C. Hendriks. This book was released on 2013-01-01. DFT-Domain Based Single-Microphone Noise Reduction for Speech Enhancement available in PDF, EPUB and Kindle. As speech processing devices like mobile phones, voice controlled devices, and hearing aids have increased in popularity, people expect them to work anywhere and at any time without user intervention. However, the presence of acoustical disturbances limits the use of these applications, degrades their performance, or causes the user difficulties in understanding the conversation or appreciating the device. A common way to reduce the effects of such disturbances is through the use of single-microphone noise reduction algorithms for speech enhancement. The field of single-microphone noise reduction for speech enhancement comprises a history of more than 30 years of research. In this survey, we wish to demonstrate the significant advances that have been made during the last decade in the field of discrete Fourier transform domain-based single-channel noise reduction for speech enhancement.Furthermore, our goal is to provide a concise description of a state-of-the-art speech enhancement system, and demonstrate the relative importance of the various building blocks of such a system. This allows the non-expert DSP practitioner to judge the relevance of each building block and to implement a close-to-optimal enhancement system for the particular application at hand. Table of Contents: Introduction / Single Channel Speech Enhancement: General Principles / DFT-Based Speech Enhancement Methods: Signal Model and Notation / Speech DFT Estimators / Speech Presence Probability Estimation / Noise PSD Estimation / Speech PSD Estimation / Performance Evaluation Methods / Simulation Experiments with Single-Channel Enhancement Systems / Future Directions

A Perspective on Single-Channel Frequency-Domain Speech Enhancement

2011-03-01 Jacob Benesty

Author : Jacob Benesty
Release : 2011-03-01
Genre : Technology & Engineering
Kind :
Book Rating : 994/5 ( reviews)

A Perspective on Single-Channel Frequency-Domain Speech Enhancement - read free eBook in online reader or directly download on the web page. Select files or add your book in reader. Download and read online ebook A Perspective on Single-Channel Frequency-Domain Speech Enhancement write by Jacob Benesty. This book was released on 2011-03-01. A Perspective on Single-Channel Frequency-Domain Speech Enhancement available in PDF, EPUB and Kindle. This book focuses on a class of single-channel noise reduction methods that are performed in the frequency domain via the short-time Fourier transform (STFT). The simplicity and relative effectiveness of this class of approaches make them the dominant choice in practical systems. Even though many popular algorithms have been proposed through more than four decades of continuous research, there are a number of critical areas where our understanding and capabilities still remain quite rudimentary, especially with respect to the relationship between noise reduction and speech distortion. All existing frequency-domain algorithms, no matter how they are developed, have one feature in common: the solution is eventually expressed as a gain function applied to the STFT of the noisy signal only in the current frame. As a result, the narrowband signal-to-noise ratio (SNR) cannot be improved, and any gains achieved in noise reduction on the fullband basis come with a price to pay, which is speech distortion. In this book, we present a new perspective on the problem by exploiting the difference between speech and typical noise in circularity and interframe self-correlation, which were ignored in the past. By gathering the STFT of the microphone signal of the current frame, its complex conjugate, and the STFTs in the previous frames, we construct several new, multiple-observation signal models similar to a microphone array system: there are multiple noisy speech observations, and their speech components are correlated but not completely coherent while their noise components are presumably uncorrelated. Therefore, the multichannel Wiener filter and the minimum variance distortionless response (MVDR) filter that were usually associated with microphone arrays will be developed for single-channel noise reduction in this book. This might instigate a paradigm shift geared toward speech distortionless noise reduction techniques.

Advances in Intelligent Computing and Communication

2021-05-22 Swagatam Das

Author : Swagatam Das
Release : 2021-05-22
Genre : Technology & Engineering
Kind :
Book Rating : 951/5 ( reviews)

Advances in Intelligent Computing and Communication - read free eBook in online reader or directly download on the web page. Select files or add your book in reader. Download and read online ebook Advances in Intelligent Computing and Communication write by Swagatam Das. This book was released on 2021-05-22. Advances in Intelligent Computing and Communication available in PDF, EPUB and Kindle. This book presents high-quality research papers presented at the 3rd International Conference on Intelligent Computing and Advances in Communication (ICAC 2020) organized by Siksha ‘O’ Anusandhan Deemed to be University, Bhubaneswar, Odisha, India, in November 2020. This book brings out the new advances and research results in the fields of theoretical, experimental, and applied signal and image processing, soft computing, networking, and antenna research. Moreover, it provides a comprehensive and systematic reference on the range of alternative conversion processes and technologies.

You may also like...