数据仓库及应用数据仓库无锡学习教案.pptx
会计学1数据仓库及应用数据仓库及应用(yngyng)数据仓库无锡数据仓库无锡第一页,共37页。Data warehouse is a very large database that stores Data warehouse is a very large database that stores integrated data of one or more business subject areasintegrated data of one or more business subject areas Data warehouse is built to support data analysis for Data warehouse is built to support data analysis for decision makingdecision making Integrated customer data warehouse is a necessary step to the Integrated customer data warehouse is a necessary step to the success of the business intelligence strategysuccess of the business intelligence strategy Data warehouses are also used for many other purposes Data warehouses are also used for many other purposes such as product manufacturing data warehousessuch as product manufacturing data warehouses Data warehouses become the focal point in the Data warehouses become the focal point in the enterprise-wide IT infrastructureenterprise-wide IT infrastructureWhat is Data Warehouse?第1页/共37页第二页,共37页。What is Data WarehouseWhat is Data WarehouseA data warehouse is simply a single,complete,and consistent store of data obtained from a variety of sources and made available to end users in a way they can understand and use in a business context.第2页/共37页第三页,共37页。What is Data WarehouseWhat is Data Warehouse第3页/共37页第四页,共37页。Data Warehouse DefinitionsData Warehouse DefinitionsThe key elements in the definitions:Subject-Oriented:Subject-Oriented:Presentation as business subjects,not Presentation as business subjects,not as as computer puter files.Integrated:Integrated:A single source of information for and A single source of information for and about the business.about the business.Non-Volatile:Non-Volatile:Stable information that doesnt change each Stable information that doesnt change each time an operational process is executed.time an operational process is executed.Time-Variant:Time-Variant:Containing a history of the business,as Containing a history of the business,as well as current business information.well as current business information.Accessible:Accessible:The primary purpose of a data The primary purpose of a data warehouse is warehouse is to provide readily accessible to provide readily accessible information to information to business people.business people.第4页/共37页第五页,共37页。Subject-OrientedSubject-Oriented第5页/共37页第六页,共37页。IntegratedIntegrated第6页/共37页第七页,共37页。IntegratedIntegrated网管系统财务管理系统市场分析决策系统Telcom DWSystem分析型大客户CRM系统信用度管理客服系统(Call Center 1000/180/112/170)系统其它电信专业网业务系统“九七工程”之营业管理生产调度系统(配线/配号/开通)112故障查修系统资源管理系统170/催缴系统营业系统本地公司/IC卡/磁卡管理系统网上营业系统商务管理层省级计费结算计费帐务系统中心业务管理层网络及网元管理层电信网网元第7页/共37页第八页,共37页。Non-VolatileNon-Volatile第8页/共37页第九页,共37页。Time VariantTime Variant第9页/共37页第十页,共37页。AccessibleAccessible第10页/共37页第十一页,共37页。Data Warehouse CharacteristicsData Warehouse CharacteristicsData warehouse separates functions from operational systems.Data warehouse separates functions from operational systems.PropertyOperationalData WarehouseResponse TimeSub-second to secondsSeconds to hoursData OperationDMLPrimarily read onlyNature of Data30-60 daysSnapshots over timeData OrganizationApplicationSubject,TimeSizeSmall to largeLarge to very largeData SourcesOperational,InternalOperational,Internal,ExternalActivitiesProcessesAnalysis第11页/共37页第十二页,共37页。Data Warehouse CharacteristicsData Warehouse CharacteristicsData warehouse serves as a central repository for recording everything Data warehouse serves as a central repository for recording everything about the business for information retrieval.Data is loaded from internal about the business for information retrieval.Data is loaded from internal business operational system,and external systems.business operational system,and external systems.第12页/共37页第十三页,共37页。Data Warehouse CharacteristicsData Warehouse CharacteristicsA data warehouse A data warehouse has a has a fundamental fundamental effect on how the effect on how the users see the data users see the data available about available about the organization,the organization,what to do with what to do with it and how to use it and how to use it for decision it for decision making.making.第13页/共37页第十四页,共37页。Data Warehouse CharacteristicsData Warehouse Characteristics第14页/共37页第十五页,共37页。Data Warehouse CharacteristicsData Warehouse CharacteristicsA data warehouse A data warehouse is not a single is not a single software or software or hardware product hardware product you purchase to you purchase to strategic.It is a strategic.It is a computing computing environment environment where users can where users can find strategic find strategic information to information to make better make better decisions.It is a decisions.It is a user-centric user-centric environmentenvironment.第15页/共37页第十六页,共37页。Data Warehouse CharacteristicsData Warehouse CharacteristicsData warehouse is a blend of many different technologies needed Data warehouse is a blend of many different technologies needed for supporting the various functions of a data warehouse for supporting the various functions of a data warehouse environment.These different technologies all work together in a environment.These different technologies all work together in a data warehouse environment.data warehouse environment.ApplicationAdministrationStorage ManagementAnalysisData ManagementData ModelingData AcquisitionData Warehouse第16页/共37页第十七页,共37页。Enterprise Data WarehouseEnterprise Data WarehouseEnterprise data warehouses are funded on a corporate Enterprise data warehouses are funded on a corporate basis.Enterprise data warehouse covers the entire business basis.Enterprise data warehouse covers the entire business(corporation),incorporating data from all operational(corporation),incorporating data from all operational systems.Information is extracted from the operational systems.Information is extracted from the operational environment,cleansed,and transformed into a central,environment,cleansed,and transformed into a central,integrated enterprise-wide data warehouse environment,so integrated enterprise-wide data warehouse environment,so that all the departments and other internal organizations of that all the departments and other internal organizations of the corporation can benefit from a consistent,integrated the corporation can benefit from a consistent,integrated source of decision support informationsource of decision support information.第17页/共37页第十八页,共37页。Data MartData MartData marts are often funded on a departmental basis.Data mart is a collection of data tailored to the DSS processing needs of a particular department.It is a subset of a enterprise data warehouse that has been customized to fit the needs of a department.Data marts serve users at a specific level,or for a specific department.第18页/共37页第十九页,共37页。Data Warehouse versus Data MartData Warehouse versus Data Mart PropertyData WarehouseData MartScopeEnterpriseDepartmentSubjectsMultipleSingle-subjectData SourceManyFewSize(Typical)TB TBImplement Time Months to yearsMonths第19页/共37页第二十页,共37页。Data MartData Mart第20页/共37页第二十一页,共37页。Data MartData MartControl:A department can completely control the data and processing that occurs inside a data mart.Cost:The cost of storage and processing is less,because the data marts machine is smaller than DWsCustomization:The data marts data is customized to suit the peculiar needs of the department.第21页/共37页第二十二页,共37页。Data MartData Mart第22页/共37页第二十三页,共37页。Data MartData Mart第23页/共37页第二十四页,共37页。Data MartData MartDependent Data Mart:The source is the data warehouse.The extraction,transformation,and loading process is easy.The data mart is part of the enterprise plan.Independent Data Mart:The source are operational system external source.The extraction,transformation,and loading process is difficult.The data mart is built to satisfy analytical needs.第24页/共37页第二十五页,共37页。Operational Data Store(ODS)Operational Data Store(ODS)n nIntegrate information from the production system.n nRelieve the production systems reporting and analysis demands.n nProvide access to current data.第25页/共37页第二十六页,共37页。ODSODS第26页/共37页第二十七页,共37页。ODSODS ODS looks very much like a data warehouse,such as subject-oriented,and integration.However,the remaining characteristics of an ODS are quite different from a data warehouse:Volatile:An ODS can be updated as a normal part of processing.Current-Values:An ODS typically contains daily,weekly,or even monthly data,but the data ages very quickly.Detailed Data:An ODS contains detailed data only.第27页/共37页第二十八页,共37页。Different Classes of the ODSDifferent Classes of the ODSClass I:A synchronous interface in which a very,very small amount of time lapses between an applications transaction and the reflection of the transaction in the ODS.Class II:If an hour or two passes from the time a transaction is created and interacted in the application environment until that transaction is reflected in the ODS.Class III:There may be a time lag between 12 hours and a day as transaction data is collected in the I&T interface.Class IV:The data is fed into the ODS directly from the data warehouse.第28页/共37页第二十九页,共37页。Determining the ClassDetermining the Classn nSpeed of movement of data into the ODSSpeed of movement of data into the ODSn nVolume of data that must be movedVolume of data that must be movedn nVolume of data that must be stored in intermediate Volume of data that must be stored in intermediate location during I&T processinglocation during I&T processingn nUpdate of data and integrity of transaction processingUpdate of data and integrity of transaction processingn nThe time of day the movement needs to occurThe time of day the movement needs to occur第29页/共37页第三十页,共37页。Data ArchitectureData WarehouseOperational Data Store ODSOperational Data Store ODSLegacy System Legacy System Legacy System Legacy System Call CenterWebEmailATMSFASupport Operational CRMSupport Analytical CRM第30页/共37页第三十一页,共37页。Example:The Content of a Customer ODSExample:The Content of a Customer ODSIdentificationNameAddressPhoneE-mailPreferencesOpt in/outMediumData sharingTransactionsPurchasesCancellationsReturnsHH/Company AffiliationEventsComplaintsPre-approvalsInquiriesSales callsCustomer ODSCorporate HierarchyHousehold link第31页/共37页第三十二页,共37页。数据仓库系统数据仓库系统(xtng)的体系结构的体系结构n n两层架构(ji u)n n(Generic Two-Level Architecture)n n独立型数据集市n n(Independent Data Mart)n n依赖型数据集市和操作型数据存储n n(Dependent Data Mart and Operational Data Store)第32页/共37页第三十三页,共37页。两层数据仓库体系结构两层数据仓库体系结构 第33页/共37页第三十四页,共37页。基于基于基于基于(jy)(jy)独立数据集市的数据仓库体系独立数据集市的数据仓库体系独立数据集市的数据仓库体系独立数据集市的数据仓库体系结构结构结构结构 第34页/共37页第三十五页,共37页。基于依赖型数据集市基于依赖型数据集市基于依赖型数据集市基于依赖型数据集市(jsh)(jsh)和操作型数和操作型数和操作型数和操作型数据存储据存储据存储据存储(ODS)(ODS)的数据仓库体系结构的数据仓库体系结构的数据仓库体系结构的数据仓库体系结构 第35页/共37页第三十六页,共37页。第36页/共37页第三十七页,共37页。