Dissertation > Excellent graduate degree dissertation topics show

Research on the Automatic Generation of Customer Reviews Report Facing Network

Author: ShiYingChao
Tutor: HuMingHan
School: Northeastern University
Course: Computer Software and Theory
Keywords: sentiment analysis product attribute extraction template report generation natural language processing
CLC: TP391.1
Type: Master's thesis
Year: 2011
Downloads: 4
Quote: 0
Read: Download Dissertation

Abstract


As e-commerce is becoming more and more popular, the number of customer reviews that a product receives grows rapidly. For a popular product, the number of reviews can be in thousands or more. This makes it difficult for a potential customer to read them and make an informed decision on whether to purchase the product or not. It also makes it difficult for the manufacturer of the product to keep track and to manage customer opinions. In this research, we aim to mine all the customer reviews of a product and generate a report about it. This report is different from traditional text summarization. Because we are only concerned with the attributes of the product on which the customers have expressed their opinions.For extracting product attributes and customer reviews mining, this paper builds some resources. These resources include intensifier dictionary, negative words dictionary, first person pronoun dictionary, modal particles dictionary, structure words dictionary, feeling words dictionary and opinion words dictionary. These resources are common resources, rather than resources built for specific areas. When building the opinion words resources, this paper presents a method to build attributes and swing opinion words pairs based on template scoring, which is proved effective to analyze the opinion of the sentences with uncertain opinion words.In recent years, researchers have proposed a number of product-attributes extracted approaches. There are template-based approach and association rules approach, this paper analyzes the shortcomings of these two methods, and proposes a product-attributes extracted method based on templates which are automatically generated. First we extract nouns and noun phrases. At the same time we extract the verb and gerund structure by rules. They all form the candidate product attributes. Frequency information and stop-word dictionary are then combined to filter the candidate product attributes. And then we filter the candidate product attributes based on the template scoring. Finally, we classify the product attributes to master-slave attributes and single attributes. At this stage, we try to discovery new words and identify whether the new words are product attributes. According to the characteristics of different types of opinion sentences, we use appropriate opinion mining technology to deal with problems of semantic polarity. For sentences with different number of opinion words and product attribute words, we use different methods. There are two main methods to analysis sentiment:based on the opinion words matching and based on machine learning. Because based on machine learning methods require a large number of labeled corpuses and have a relatively poor portability. So based on the first method, this paper presents a new sentiment analysis method that based on attribute-opinion pairs to deal with the sentences which have attribute words and opinion words, and proposes a method for sentiment analysis based on opinion-sentence templates to deal with the sentences that have no option words. During dealing with comment sentences that have negative words, this paper proposes a negative shift algorithm.In the stage of report generation, this paper presents a method that based on hierarchical product attributes. We finally generate the report by the product attributes’ sentiment polarity and their slave attributes’ sentiment polarity.

Related Dissertations

  1. Preparation of ITO Quai-1D Nanostructures by Sol-Gel Method Combined with Porous Anodic Aluminum Oxide Template,TB383.1
  2. Study on the Synthesis Bi3.25La0.75Ti3O12(BLT) Nanotubes and Nano Wires,TB383.1
  3. Hydrothermal Synthesis of Oxide Hollow Spheres,TB383.4
  4. Research on Algorithms of 2D Face Template Protection,TP391.41
  5. Word Sense Disambiguation Corpus Automatic Acquisition,TP391.1
  6. Research on Secure Fingerprint Authentication Based on Distance Projection Coding,TP391.4
  7. Incomplete information on the completeness of the system and its knowledge acquisition,TP311.13
  8. On the television program template of intellectual property protection,G222
  9. Copper oxide porous hollow microspheres Preparation and Characterization,O614.121
  10. Research of Website Development Technology Base on Model Driven Development Methodology,TP393.092
  11. User-Steered Development of Personalized Information Sevice,TP393.09
  12. Application of Two Poles Amino-acid Derivative to Synthesis Helical and Chiral Nanomaterial,TB383.1
  13. Preparation of Silica Hollow Spheres with Mesopores in the Walls,TB383.1
  14. Synthesis and Characterization of Polyaniline Micro/nano Structure and Its Composites,TB383.1
  15. Preparation and Photocatalytic Properties of Rare Earth Doped TiO2 and ZnWO4 with Different Morphologies,O643.36
  16. Tracking Events for Food Complaint Documents Based on Ontology,TP391.1
  17. Research on Opinion Target Extraction,TP391.1
  18. Porous MIn 2 S 4 (M = Zn, Cd) photocatalyst Preparation, Characterization and Photocatalytic Performance,O643.36
  19. Research on High Performance Architectural Concrete and Technology for Construction,U444
  20. Synthesis and Modification of SAPO-34 Molecular Sieve Catalyst for Methanol-to-Olefin Reaction,TQ221.2
  21. Team send a single management system design and implementation,TP311.52

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile