Dissertation > Excellent graduate degree dissertation topics show

Research on Subspace-Based Speech Enhancement

Author: NiuTong
Tutor: ZhangLianHai
School: PLA Information Engineering University
Course: Signal and Information Processing
Keywords: Speech enhancement Subspace Spectral domain constrained Perceptual weighting Noise spectral estimation Weighted minimum statistics Noise autocorrelation estimation Speaker identification
CLC: TN912.35
Type: Master's thesis
Year: 2009
Downloads: 69
Quote: 3
Read: Download Dissertation

Abstract


In most applications, the aim of speech enhancement is to improve the quality and intelligibility of degraded speech, and it’s important to promote the performance of speech processing system in noise environments. This dissertation mainly focuses on the subspace-based speech enhancement and the noise estimation algorithms, containing the following aspects:To minimize speech distortion and overcome the poor performance of subspace algorithms in colored noise environments, an improved optimal subspace algorithm is derived. The proposed algorithm is based on the theory of spectral domain constrained estimation and the characteristic of generalized eigenvalue decomposition. Furthermore, a unified notation of the proposed algorithm is provided for both white and colored noise environments. Results, based on informal listening tests and objective measures, indicated significant improvements in speech quality with the proposed algorithm.To make the residual noise perceptually inaudible, a well known perceptual weighting technique from speech coding is used to shape the residual noise spectrum. Under the minimum perceptually weighting errors criterion, a perceptually motivated subspace algorithm to speech enhancement is proposed in the spectral domain. The results demonstrated significant improvements in speech intelligibility with the perceptually motivated subspace algorithm.As the noise spectral estimation based on the minimum statistics introduces significant tracking latency when the noise spectral rises, an improved algorithm based on the weighted minimum statistics is proposed. Analyzing the influence of the weight on the noise spectral estimation, three kinds of typical simple curves are used to compute the weight, and the experiment shows that the weight computed by the cosine curve is the best. The simulation results showed that the improved algorithm traced the change of noise spectral quickly in most cases as well as improved the accuracy of the noise spectral estimation and the quality of speech in the non-stationary noise environment. Furthermore, to improve the performance of subspace methods in the non-stationary noise environment, the improved algorithm is extended into subspace by using the relationship between signal spectrum and autocorrelation.To estimate the noise autocorrelation from the noisy speech autocorrelation directly, an unbiased noise autocorrelation estimator is proposed under the minimum mean square error criterion, which is based on the optimal first-order smoothing recursion and energy minimum algorithm. The simulation results showed that the proposed estimator outperformed the traditional estimators, especially under the non-stationary noise environments.Finally, we apply the subspace-based speech enhancement to the speaker recognizer as the pre-processor module .The results showed that the introduced module improved the precision of the feature extraction as well as the speaker identification rates.

Related Dissertations

  1. Study of Speech Codeing System Based on Conjugate Structure Algebraic Code Excited Linear Prediction,TN912.3
  2. Linear Operator Broadcast Channels,TN911.22
  3. Design and Realization of Channel Estimation for MIMO-OFDM Systems Based on Subspace,TN919.3
  4. Non-negative matrix factorization based on sparse image retrieval,TP391.41
  5. Subspace-based small unmanned rotorcraft System Identification,V275.1
  6. Research on Subspace Analysis Based Face Recognition Algorithm,TP391.41
  7. Research on Mixed-pixel Unmixing of Hyperspectral Imagery,TP751
  8. Visual Saliency Detection Via Tensor Decomposition,TP391.41
  9. Blind Channel Identification Based on Subspace Method of Second-order Statistics,TN911.23
  10. Collaborative Filtering in Personalized Recommendation Based on High-dimensional Subspace Clustering,TP393.09
  11. Detection of Application-Level Failures in Large-Scale Internet Service,TP393.09
  12. Domain Decomposition Algorithm for Compact Difference Schemes of Heat Equation,O241.82
  13. Face recognition technology under complex conditions,TP391.41
  14. Model Predictive Control of Air System of Thermal Boiler Based-on Subspace Identification Method,TM621.2
  15. Study of Parameter Estimation Algorithm of Multidimensional Sinusoidal Signal,TN911.7
  16. The Fast Method for DOA Estimation of Wideband Signals and Its Implementation on DSP,TN911.7
  17. The Study of Speech Codec Based on Perceptual Quality,TN912.3
  18. Adaptive multi-rate wideband speech coding algorithm simulation and research,TN912.3
  19. Multi- pose face recognition based on subspace learning complex scenes,TP391.41
  20. Research on Feature Transformation and Robust Technology with Speaker Identification,TN912.34

CLC: > Industrial Technology > Radio electronics, telecommunications technology > Communicate > Electro-acoustic technology and speech signal processing > Speech Signal Processing > Speech Enhancement
© 2012 www.DissertationTopic.Net  Mobile