Dissertation > Excellent graduate degree dissertation topics show

Analysis and Implement of Data Management Based on ETL

Author: WangBing
Tutor: JiangNingKang
School: East China Normal University
Course: Software Engineering
Keywords: Data cleansing Data Replication ODI
CLC: TP311.13
Type: Master's thesis
Year: 2008
Downloads: 123
Quote: 1
Read: Download Dissertation

Abstract


With the rapid development of the computer network and database technology as well as the diversity of people access to data means , various data resources increasingly rich dramatic increase in the amount of data , and the University, as an important member of the community of nations , the degree of information technology and network ensue a tremendous change in many sectors in varying degrees, rely on computer software to assist in the completion of work , improve business processes through the use of these software capabilities and efficiency of the office . However , an increasing number of different types of information and data to the database management brings a lot of problems , mainly in the two major aspects of the data cleaning and data replication to correct data errors such as how to avoid wrong decisions , to reduce the risk of decision-making ? How to between the various departments both flexible information exchange and sharing , but also unified management and use ? currently the main method is synchronous replication of data cleaning and data on these data . Metadata cleaning so we get is credible , safe , consistent , and then after cleaning the data through data replication tools poured into public databases , so that the various departments of the school to be able to share data resources . This paper introduces the principle of ETL (Extract, Transfer, Load) - based data cleaning and data replication , and apply them in practical work , the main work is as follows : ( 1) Introduction cleaning technology at home and abroad at this stage data replication and data its application ; ( 2 ) pointed out between the various departments of the University of the data source , the problems of data quality and data consistency ; ( 3 ) analysis of data quality problems exist cleaning and replication strategies and design ; ( 4 ) describe how use of data cleaning and replication the tools Oracle Data Integrator ( referred ODI) extracted the data from various data sources , in accordance with predetermined rules to clean , and then transfer to copy loaded into the target database (ie, public database ) , in order to achieve data the purpose of sharing resources . ( 5 ) papers in the prevention of suspicious data cleaning strategies and how to balance the efficiency and performance of data replication needs to be further discussion .

Related Dissertations

  1. Clinical Study of the Effect of Balancing Technique Acupuncture on Lumbar Intervertebral Disc Herniation,R246
  2. Clinical Observation on Surgical Treatment of Lumbar Disc Herniation with Microendoscopic Discectomy,R687.3
  3. Research on Maintenance Engineering Capability Evaluation of Civil Aviation and Development of Decision-making Support System,F426.5
  4. Research on Dynamic Data Replication in Dameng Database System,TP311.13
  5. The application of data mining technology in electricity sales in the auto insurance,TP311.13
  6. Analysis of market sales data mart design,TP311.13
  7. Converged storage system research and design data replication,TP333
  8. Weak consistency of distributed data maintenance strategy study,TP311.13
  9. The Research and Implementation of Distributed Data Replication Over Slow Network Connections,TP311.13
  10. The Research and Application of Data Cleaning Technique,TP311.13
  11. Research and Design of generic data release of the financial advisory business platform,TP311.10
  12. Research and implementation of distributed futures trading platform,TP311.52
  13. Research and Application on Data Preprocessing Algorithms,TP311.13
  14. Research on Data Consistency in Mobile Transaction Processing,TP311.13
  15. Research on Grid Scheduling,TP393.01
  16. Design and Implementation of Data Replication Module in Air Traffic Control System,TP311.52
  17. Design and Implementation of a Mobile DBMS with Multiversioned-DBS,TP311.13
  18. Application of Artificial Intelligence on Data Cleaning,TP18
  19. Data Consistency Check and Data Quick Recovery Methods in Disaster Recovery System,TP393.08
  20. A Study on Outward Direct Investment in Tianjin,F127
  21. The Research and Realization of Basic Information Platform of Railway,TP311.52

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Database theory and systems
© 2012 www.DissertationTopic.Net  Mobile