Dissertation > Excellent graduate degree dissertation topics show

Speaker Recognition Research Based on Chinese Vowel Mapping Methods

Author: QianBo
Tutor: TangZhenMin
School: Nanjing University of Technology and Engineering
Course: Pattern Recognition and Intelligent Systems
Keywords: Speaker Recognition Vowel classification Chinese vowel map Vector quantization Biomimetic recognition BP neural network Neural network ensemble Vowel frame detection Pitch frequency Noise processing technology Gaussian mixture model Feature Compensation
CLC: TN912.34
Type: PhD thesis
Year: 2007
Downloads: 277
Quote: 2
Read: Download Dissertation

Abstract


Speech is the most convenient, fast and natural tool to communicate with other people. In recent thirty years, along with the development of science and technology, the research of speaker recognition technique has achieved many productions, which will bring us more convenience in our daily life. However, in different application, the standards and requirements become much more higher and the system is susceptible to different influence. On one hand, speech signal is non-stationary, which requires high adaptability in real system; On the other hand, speaker recognition system will be influenced by many factors, such as noises, training time and the distortion of communication channels. The foremost thing in speaker recognition is how to extract appropriate features, which only reflect speaker’s identity information and avoid the semantic disturbance, and how to establish an effective model, which can effectively make use of the available data and be robust to actual different environments. In this paper, we researched on speaker recognition system from two aspects, separating exactly identity information and improving the robustness of system based on Chinese mandarin, and proposed novel algorithms and models.Firstly, in this paper, we presented a novel framework of speaker recognition based on Chinese vowel mapping technique. The base of this framework is the decomposition of Chinese multi-vowel with single-vowel phonemes. According to contrast the spectrum, features, single-vowel phoneme glide statistical distribution and the performance of vowel classification, we confirmed that Chinese vowel could be separated into several single-vowel phonemes based on the short time characteristic. Then we built up a new mapping table from multi-vowel to single-vowel phoneme as the assistant of the latter research through a great deal of experiment and theory. The new framework added a special model to implement the separating and organized several single-vowel classifiers to replace the traditional classification module, which can not only avoid the disturbance of semantic information and achieve higher performance, but also intensify the pertinence of classifiers compared with the traditional classifiers. In the new framework, it adopts short time frame as the basic identify unit, which makes it more compatible to real time system.Under the new framework, we improved the method of vector quantization based on the classifier of Chinese vowel. Because each VQ classifier only deals with one certain kind of phoneme, it can avoid the influence of semantic information, and achieve higher accuracy and performance with smaller codebook than traditional VQ method; However, in order to assure the quality of codebook, it needs a great deal of data during training and testing phase, so we proposed a new Chinese speaker identification system based on biomimetic pattern recognition combining foregoing new framework. We improved the nearest neighbor algorithm to find the cover of each phoneme in the eigenspace for every speaker. During the identification phase, the final decision will be made according to the relationship between the cover and the feature characteristic. Experimental results demonstrate that the system can efficiently reduce the requirement of data. During the research, we find that the new system will introduce in classifying error more or less and decelerate the recognition speed because the new framework increased a special vowel classification module. Owing to this, we proposed a novel neural network ensemble system based on Chinese vowel mapping technique using the ensemble learning theory. During recognizing phase, the system needn’t special vowel classification, so it can avoid error in some sense and speed up the whole system.Furthermore, we still research on pre-processing module and decrease the disturbance of noise for our new framework. A self-adaptive vowel-frame detection algorithm based on energy distribution analysis in frequency domain was presented to extract vowel frame more accurately. We also proposed a new method by modeling the background noise to statistically estimate Gaussian Mixture Model for the pure speaker information. At the end, a robust speaker verification method based on weighted feature compensation transformation is presented during the feature processing and model compensation.The sufficient theory analysis and experimental results demonstrated that the presented model and algorithms based on novel framework have achieved higher accuracy, speed and enhanced the robustness in different conditions compared with many traditional methods. Specially, we succeed in separating personal identification information from semantic information based on classifying the Chinese vowel, which will be a new way to transform the text-independent system into text-dependent speaker recognition system.

Related Dissertations

  1. Research on Lapped Transform and Vector Quantization Based Image Coding Algorithm and Applications,TN919.81
  2. Research on Feature Extraction and Classification of Tongue Shape and Tooth-Marked Tongue in TCM Tongue Diagnosis,TP391.41
  3. Research on Visual Servo System of Mechanical ARM,TP242.6
  4. Municipal tourism land use planning environmental impact assessment,X820.3
  5. Study on Taste Characteristic of Taste Peptide Enzymatic Production from Oyster Base on A Neural Network Method,TS254.4
  6. The Research on Evaluation of Living Status Systems of Expressway Relocated People,D523
  7. Mine Risk Information Integration and Intelligent Early Warning,X936
  8. Research of Orange Quality Classification Technology Based on Computer Vision,TP391.41
  9. Detection and Tracking of Moving Object in Complex Background,TP391.41
  10. Optimization Study on Gating System and Molding Process Parameters of Injection Mold Based on Simulation,TQ320.662
  11. Study on Luohe Technical Supervision Bureau of Food Safety Early Warrning System Based on Neural Network,F203
  12. Research of Adaptive Active Noise Control Based on Neural Network,TP183
  13. Research on Automatic Reading System for Digital Meters,TP391.41
  14. Research on Feature Extraction, Selection and Classification Algorithms for Pulmonary CAD,TP391.41
  15. The Research of Evaluation Method in Connect6 Based on BP-TD Learning,TP18
  16. Research on State Diagnosis on Fan Based on Factor Analysis and BP Neural Network,F426.61
  17. The Research and Design of Converter Steelmaking Endpoint Guiding System,TF345
  18. Analysis on Water Ecological Carrying Capacity of Jiangxi Province,TV213.4
  19. Research on Chinese Speech Processing and Speech Enhancement in Hearing Aids,TN912.3
  20. Key Algorithm in High Quality Voice Conversion System,TN912.3
  21. Research Offingerprint of Communication Behavior,TP311.13

CLC: > Industrial Technology > Radio electronics, telecommunications technology > Communicate > Electro-acoustic technology and speech signal processing > Speech Signal Processing > Speech Recognition and equipment
© 2012 www.DissertationTopic.Net  Mobile