Dissertation > Excellent graduate degree dissertation topics show

Research on Feature Transformation and Robust Technology with Speaker Identification

Author: XuLiMin
Tutor: TangZhenMin
School: Nanjing University of Technology and Engineering
Course: Pattern Recognition and Intelligent Systems
Keywords: Speaker identification Feature transformation Multi-step clustering weighted features compensation transformation Adaptive histogram equalization Noise robustness
CLC: TN912.34
Type: PhD thesis
Year: 2008
Downloads: 188
Quote: 0
Read: Download Dissertation

Abstract


This dissertation focuses on the research on Transformation-based Gaussian mixture model, weighted features compensation transformation and adaptive histogram equalization to improve the performance of speaker identification and the robustness in practical application environment. Including:1. A multi-step clustering algorithm with transformation-based and diagonal-covariance Gaussian mixture model (GMM) is advanced. In order to simplify the computation, Gaussian mixture density functions always use diagonal covariance matrices. However this also reduces the likelihood of the data, which could consequently affect the classification decision. In order to compensate the losing likelihood, the multi-step clustering algorithm is proposed. In this algorithm, the embedded linear transformation is used to integrate both transformation and diagonal-covariance Gaussian mixture into a unified framework. Also a multi-step cluster algorithm is integrated into the estimating process of GMM to search the appropriate mixture number. Compared with, the estimation frequency is obviously reduced. Compared with the traditional cluster expectation-maximization (EM) algorithm, the newly proposed method can save 50% of time and the error rates decrease by 1.4% on average on the same database. Compared with the transformation embedded GMM, the experiment with two databases indicate that the method reformed in the paper can directly reach the best point of saturation with the right mixture number.2. A weighted features compensation transformation method based on GMM for robust speaker verification is presented. In the method, the scores of features are weighted through frame SNR, while the frame likelihood probabilities are transformed based on the acoustic characteristic of speaker recognition system. In stationary and non-stationary noise environment with different SNR, compared with the features weighted algorithm, this proposed method can achieve the average recognition rate increase by 2.74% and 2.82%, while the method have the average recognition rate increase of 3.56% and 1.34% compared with the normalization of compensation transform method on the same database. On the another open database, the increments are 3.02% and 2.56% compared with the features weighted algorithm, while compared with the normalization of compensation transform method, the increments are 3.9% and 1.14%.3. Based on the statistical characteristics of speaker feature and the particularity of histogram equalization applied to speaker recognition, the adaptive histogram equalization (AHEQ) method for speaker recognition is presented. In this method, the cumulative histogram function is first created with the wide range and then According to the frequency range eigenvalue increment from the size of the interval to determine the need for further delineation and demarcation level. This approach not only reduce the amount of computation, but also the transformation of the eigenvalues more in line with the actual distribution of feature space, making it possible to further improve the recognition rate and robust of Speaker Identification System in noise environment. In the same database, the study used two classic noise (that is, White and Babble), compared with ordinary histogram equalization method, the average recognition rate of AHEQ is increased by 3% and 2.9%. In another comparison testing focused, the performance of the adaptive histogram equalization method is similar improvement.

Related Dissertations

  1. Compensation Methods of Different Speech Coding for Speaker Recognition,TN912.34
  2. Machine learning method based automatic classification of EEG,TP181
  3. Research on Subspace-Based Speech Enhancement,TN912.35
  4. Research on Simulation of SAR Image and Application Based on Image Characteristics,TN957.52
  5. Study on the Structural Damage Identification Based on the Acceleration Response and BP Neural Network,TU312.3
  6. 3D mesh model watermarking algorithm,TP309.7
  7. The Technology of Moving Target Detection and Tracking in Dynamic Scene,TP391.41
  8. Research on the Method for Face Recognition Based on Extremely Randomized Trees,TP391.41
  9. The Research on Speaker Recognition of Wireless Access Control System,TP273.5
  10. Discrimination of Art Features about Blue and White Porcelain Painting Flowers and Birds Between the Civilian Kiln and the Traditional Painting,J211.27
  11. Discriminative training and discriminative adaptive acoustic models in automatic speech recognition Optimization,TN912.34
  12. A Research on Speaker Recognition Algorithm and Speaker Identification System Implementation,TN912.34
  13. Technology of Model Training in Speaker Recognition Based on Adaptation and MCE,TP391.42
  14. Any text speaker recognition system,TN912.33
  15. Research on Speaker Identification Method and System Application Development,TP391.42
  16. Speaker Identification Based on Independent Component Analysis and Genetic Algorithm,TP391.42
  17. The Research on Face Authentication System and Related Techniques Based on Trace Transform,TP391.41
  18. Research on Contrast Enhancement of Fog-Degraded Image,TP391.41
  19. Robust Speech Recognition Based on Local Time-frequency Analysis,TN912.3
  20. Study on Robust Speech Recognition Method of Isolated Word in Small Vocabulary,TN912.34
  21. Speaker Recognition Based on Continuous Hidden Markov Model,TN912.34

CLC: > Industrial Technology > Radio electronics, telecommunications technology > Communicate > Electro-acoustic technology and speech signal processing > Speech Signal Processing > Speech Recognition and equipment
© 2012 www.DissertationTopic.Net  Mobile