Dissertation > Excellent graduate degree dissertation topics show

Compensation Methods of Different Speech Coding for Speaker Recognition

Author: LiXueLin
Tutor: HanJiQing
School: Harbin Institute of Technology
Course: Computer Science and Technology
Keywords: Speaker identification Text-independent Speech coding Maximum A Posterior estimation Maximum Likelihood estimation Score compensation
CLC: TN912.34
Type: Master's thesis
Year: 2008
Downloads: 59
Quote: 0
Read: Download Dissertation

Abstract


There are so many advantages for speaker recognition technique, including flexibility, economy, accuracy, extensibility, and so on, thus it has a broad application future in biometrics recognition field. Although the system performs well in the lab, the performance descents rapidly because of the influence of various factors in the real world. One of the main factors affecting the performance is the code mismatch between training data and testing data. Especially in speaker recognition under network environment, the available training data is from some speech coder, however, in actual use the testing data is from another speech coder. In this situation, the performance of speaker recogonition is seriously affected. In order to improve the speaker recognition performance under network environment, enhance system practical level, first of all, we need to resolve speech coding mismatch problems, that is eliminating the influence resulted from the code mismatch in training and testing conditions.This paper mainly studies compensation approaches, which effectively overcome the impact of different speech coding, so as to improve the speaker recognition performance under network environment. These approaches compensate mainly in the feature domain and scoring domain. In encoding feature compensation, the MAP (Maximum A Posterior) method and the ML (Maximum Likelihood) method are applied to the speaker recognition systems. In scoring compensation, the likelihood ratio score normalization method that has been used in the channel compensation is adopted, so as to further improve system performance. We recognize firstly by GMM(Gaussian Mixture Model), and then make secondary judgement based on using coding score normalization, and finally get the recognition results. The baseline system we used is text-independent speaker identification system. Experimental results show that by firstly using MAP method to coding compensation, then using likelihood scores method to scoring compensation, the best recogonition rate is 83.4% in open set tests.

Related Dissertations

  1. The Junior Middle School Language Teaching Text Analysis on the Cultivation of Autonomous Learning Ability,G633.3
  2. Mixed Exponential Distribution under Censored Accelerated test of quadratic estimates,O211.3
  3. Based on RFID Prison Intelligent Management System Research and Implementation,TP315
  4. Research on Time Synchronization Algorithm in Wireless Sensor Networks,TN929.5
  5. Implementation and Optimization of the AMR-WB Algorithm Based on DM642,TN912.3
  6. A Random Weighted Linear Estimator of the ARCH Parameters,F830
  7. Research on Target Localization Technology in Wireless Sensor Networks,TP212.9
  8. The Life Data Analysis of the Transmitter and Receive of ZPW-2000A System,U284
  9. Research on Efficient Parameter Quantization Algorithms for Low-bit-rate Speech Coding,TN912.3
  10. Research and Implementation of Adaptive Low-Rate Speech Coding,TN912.3
  11. Research on Subspace-Based Speech Enhancement,TN912.35
  12. Theory and Application for the Logistic Regression Models Based on Case-Control Data,O212.1
  13. The Studt of ILBC Speech Coding’s Key Technology and Its DSP Design and Optimization,TN912.3
  14. The Theory and Application of Term Structure Model with Jumps,F822.0;F832.51
  15. General Exponential Distribution: Bayes Estimation under Entropy Loss Function,O211.67
  16. Improve the quality of voice -based AMR algorithm,TN912.3
  17. Compensation method based on channel speaker verification research,TN912.34
  18. Research on Automatic Evaluation of English Recitation and Retelling Test,TP391.6
  19. Parameter Maximum Likelihood Estimations from Incomplete Data in Generalized Linear Models,O212.1
  20. Maximum Likelihood Estimation of Poisson Mixed Model,F832.2

CLC: > Industrial Technology > Radio electronics, telecommunications technology > Communicate > Electro-acoustic technology and speech signal processing > Speech Signal Processing > Speech Recognition and equipment
© 2012 www.DissertationTopic.Net  Mobile