Dissertation > Excellent graduate degree dissertation topics show

Research on Chinese Spoken Term Detection Technology for News Corpus

Author: WangKeWei
Tutor: HanJiQing
School: Harbin Institute of Technology
Course: Computer Science and Technology
Keywords: spoken term detection news corpus lattice n-gram model long distancebigram model automatic corpus splicing
CLC: TN912.34
Type: Master's thesis
Year: 2012
Downloads: 17
Quote: 0
Read: Download Dissertation


Spoken term detection (STD) returns relevant segments from a given corpus of speechdata according to users’ queries which are in text form. STD is an important area ofspeech recognition and has broad application prospects. The design of STD system isusually implemented in two stages: off-line indexing and online searching. Obviously,the accuracy of the STD system is highly related to the quality of the index.Indexing is usually based on the output of the ASR system. The indices of mostSTD system are based on lattice, which is the output of the speech recognition. Thelattice has reasonable structure and contains plentiful of information. The probabilityof the local path through the lattice can be obtained according to the acousticlikelihood and language model and such information is kept in the lattice. It’s a simpleand effective way to take this probability as confidence measure when indexing. Asthe traditional N-gram model (i.e. the bigram model) does not consider the syntacticand semantic constraint of further words, it misses some information. The longdistance bigram model in this paper captures different aspects of the syntactic andsemantic constraint between words, the STD system based on the lattice and the longdistance bigram other than the traditional N-gram model will improve the quality ofthe indices and the performance of the system. Our experiments consider theperformance of the STD systems based on different distance of bigram anddemonstrate that, when integrating results from systems based on different distances,we can get higher detection recall over system based on traditional N-gram models.News corpus is an ideal choice of constructing speech recognition system in STDsystem for news databases. In the front of the STD system, the input speech needs tobe converted into text by a speech recognition system. But commercial news corpus atpresent does not have a detailed transcript. The transcript is of paragraph level notphrase level. It cannot be used when doing recognition task. This paper presents anautomatic method of segmenting the speech of paragraph level based on speechrecognition. The method constructs a linear recognition network first of all, theninserts silence models between short speech utterances, finally does decodingprocessing over the speech. The experiments demonstrate that this method shows fineperformance when splicing segments of paragraph level less than11minutes. Weconclude that it is an effective method of splicing paragraph level speech.

Related Dissertations

  1. The Study on Structural Calculation and Analysis Method of Lattice-type Crane with Variable Cross-section Boom,TH21
  2. The implication structural study interval set,O159
  3. Design and Application of LED Screen Based on WSN Bottom Module,TN312.8
  4. First-Principle Studies of Cathode Materials LiFePO4 and Its Doped Systems for Lithium Ion Batteries,TM912
  5. Design of Lattice Network Based Digital Filters with High Robustness,TN713.7
  6. Nonlinear Partial Differential Integral Method,O175.29
  7. Study of Adaptive Channel Equalizers Based on State-space Realizations,TN715
  8. Study on Surface Mine Transportation System Optimization Based on Fuzzy Multiple Objective Lattice-Order Decision Making,F426.1
  9. The Granules of Formal Concept,TP18
  10. The Research on Characteristics of Two-wavelength Terahertz Modulator Based on Compound Lattice Photonic Crystals,TN761
  11. Experimental and Numerical Investigation on Unsteady Flow Behavior in Tight Lattice,TK124
  12. Study of Lattice Boltzmann Method and Its Application on Microflows,O35
  13. Flutter Analysis of Unsteady Transonic Wing,V215.34
  14. Simulation of Underground Water Inrush Based on Lattice Boltzmann Method,TD745
  15. Numerical Simulation of Mine Water-inrush with Gravity and Parallel Algorithm Design,TD745
  16. Foil stock recrystallization and precipitation behavior,TG146.21
  17. Research on Fire Response and Performance-based Fire Resistant Design of Pre-stressed Suspended Lattice Shells,TU352.5
  18. The Technology Research of Endpoint Detection and Keywords Detection of Speech,TN912.3
  19. Aerodynamic Performance of Heavy-lift Helicopter Rotor and Effects of Blade Parameters,V211.52
  20. Rough concept lattice based multi-attribute decision analysis,O159
  21. The Algorithms of Generating Concept Lattice,O153.1

CLC: > Industrial Technology > Radio electronics, telecommunications technology > Communicate > Electro-acoustic technology and speech signal processing > Speech Signal Processing > Speech Recognition and equipment
© 2012 www.DissertationTopic.Net  Mobile