Dissertation > Excellent graduate degree dissertation topics show

Tourism Comments on the Internet’s Semantic Analysis and Usefulness Research

Author: CaoBin
Tutor: YeQiang
School: Harbin Institute of Technology
Course: Management Science and Engineering
Keywords: Text classification Semantic analysis E-business on tourism the Usefulness of comments
CLC: TP391.1
Type: Master's thesis
Year: 2008
Downloads: 501
Quote: 4
Read: Download Dissertation

Abstract


In the face of abundant comment information resources day by day on the web, how one can quickly and efficiently obtain and utilize the information effectively has become a question which people focus on recently. The objective of the research in this paper is travel comments on the Internet, by using data mining algorithms to obtain the characteristics words about the products or service from the comments, and then extract the sentences including the characteristics words from all the comments. Use LingPipe and PMI methods to analyze these characteristics sentence in order to get these sentences’emotional tendencies. And use LingPipe and Statistic methods to analyze the factors which can influence the usefulness of the tourism comments on the Internet.Firstly, the paper gives a fully introduction on the E-business on tourism. Through introduction to the concept and classification of E-business on tourism, the paper summarizes development of it and proposes the develop trends of it which lays the solid foundation for the following research.Subsequently the paper briefly summarizes the source and principles of text classification and text semantic analysis, and then introduces several main methods of text semantic analysis. In this experimental stage, first, the paper analyzes the semantic tendency about the hotel on the Ctrip website. The purpose of this part of research is to obtain the commentators’opinion on a product’s or service’s characteristics. One step I use data mining algorithms to analyze the comments on the hotels and get the characteristics words which are being paid much attention on. And then program to extract the sentences which contain these characteristics words .The next step is to use LingPipe and PMI methods to analyze emotional tendency about one characteristics word. Finally in accordance with the experiment and the final results, compare the two methods.In the second part of the research, this paper analyzes the factors which can influence the usefulness of the tourism comments on the Internet. I collected and collate the comments about the tourism destination from yahoo.com’s travel Channel, using LingPipe to analyze the comments and obtain the comments’eigenvalue in subjective and objective characteristics. By establishing logical linear regression equeation with fixed effect and its improving model, I have found that the comments which contain both subjective and objective contents and have more words are more useful.This research can quickly obtain the commentators’opinion on a product’s or service’s characteristics, the opinion includes positive and negative evaluations, and can support the reader’s decision-making. Moreover, the research also figure out what kinds of comments can be more useful. I believe that this research will have good practical application in the future.

Related Dissertations

  1. Research on Text Classification Based on Biomimetic Pattern Recongnition,TP391.1
  2. Based on Data Distribution Characteristics of Text Classification,TP391.1
  3. Research on Topic Clustering Model of Socail Tagging Based on Bayesian Theory,C93
  4. Research on Improved K Neighbor Support Vector Machine Algorithm Faced Text Classification,TP391.1
  5. Research for Event Extraction Method in Specific Domain Based on Tree Conditional Random Field,TP391.1
  6. Research on Approaches of the Subjective Automated Assessment,TP391.1
  7. Finding Web Services Based on Clustering Probabilistic Semantic Approach,TP393.09
  8. Emotional background music for movies Classification Method,TP391.1
  9. Online Education News Text Categorization System Design and Implementation,TP391.1
  10. One kind of empirical data on the workload of a software bug fixes Prediction Model,TP311.53
  11. Image / video automatic replacement of the face,TP391.41
  12. Research on cross-language text categorization,TP391.1
  13. Classification model based monitoring of e-commerce Prohibited Research and Implementation,TP393.09
  14. Based on the model and features of the package ML_pLSA Target Recognition Algorithm,TP391.41
  15. Based on semantic analysis of text mining research,TP391.1
  16. Detection and Analysis of Malware Network Behavior Based on Isolation Environment,TP393.08
  17. Based on the associated technology Chinese Text Classification,TP391.1
  18. Chinese named entity recognition and disambiguation of,TP391.1
  19. Automatic distribution of software defects,TP311.52
  20. Research of Web Text Classification Based on Decision Tree Classification Algorithm,TP391.1

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile