IBM利用Infosphere Datastage、Infosphere CDC、Infosphere Ware
IBM 提供了全面的、业界领先的数据仓库、业务分析解决方案,包括信息整合工具 Information Server;实时、增量数据复制工具 InfoSphere CDC;数据仓库解决方案 Infosphere Warehouse;业务分析工具 Cognos BI 以及一系业务分析应用等。
本文,主要为大家介绍 IBM 数据仓库、业务分析解决方案,特别是如何利用 Infosphere Datastage、Infosphere CDC、Infosphere Warehouse 及 Cognos 快速建立数据分析应用,以帮助大家快速掌握利用 Infosphere Datastage、Infosphere CDC、Infosphere Warehouse 及 Cognos 开发分析应用的基本方法。
针对数据仓库、业务分析应用,IBM 提供了全面的、业界领先的解决方案。软件方面,提供了集成的、端到端的解决方案,包括信息整合工具 Information Server;实时、增量数据复制工具 InfoSphere CDC;数据仓库解决方案 Infosphere Warehouse;业务分析工具 Cognos BI 以及一系列业务分析应用;针对 Big Data 数据分析,提供了 Infosphere BigInsights 及 Infosphere Streams;针对信息监管,提供了 Quality Stage 数据质量管理工具、Infosphere Optim 数据生命周期管理解决方案、Infosphere Guardium 数据安全解决方案,同时,IBM 还提出了 IBM Smart Analytics System 解决方案,它根据用户预计的数据仓库规模,为用户提供预先配置的、经过优化的、可以扩展的软、硬件整体套装配置方案,包括的型号、配置,存储的配置,网络的配置,可以为用户提供合理的硬件选型,并提供一站式解决方案;IBM 还提供了 Netezza 数据仓库一体机,提供性能优异、配置简单的一站式解决方案;在数据仓库模型方面,IBM 提供了针对银行、电信、保险及零售业的数据仓库模型,可以为用户提供针对行业应用的模板,加速行业应用建模。
IBM 典型的数据仓库解决方案如下图所示,我们采用 Infosphere Warehouse 作为企业数据仓库 EDW 系统;采用 Infosphere Warehouse 作为关系型数据集市系统,Cognos 作为多维数据集市系统;通过 Infosphere CDC 将业务系统的数据实时复制到 ODS 系统中;使用 Infosphere Datastage 批量装载数据到数据仓库或数据集市中;使用 Infosphere CDC 实现增量、实时数据装载功能;使用 Cognos BI 及 Cognos 应用实现业务分析功能。
图 1. IBM 数据仓库解决方案架构
下边,我们通过一个简单的“Sales Performance Analysis”的例子来介绍一下如何利用 Infosphere Datastage、Infosphere CDC、Infosphere Warehouse 及 Cognos 快速建立数据分析应用。
本次试验,我们在 DB2 9.7 中创建了 db2olap 作为 OLTP 数据源,使用 Infosphere Warehouse 提供的 DB2 9.7 创建了 olapdb 作为 ODS/ 数据仓库系统,使用 Inforsphere Datastage 8.7 作为 ETL 工具负责将需要的数据从数据源 db2olap 中抽取、并做适当的转换后装入到 olapdb ODS/ 数据仓库中,同时,我们使用 Infosphere CDC 6.5.1 及 Infosphere Datastage 8.7 提供的 CDC Transaction Stage 实现实时、增量数据装载工作,最后使用 Cognos BI 10.1.1 实现最终的报表展现、OLAP 分析及仪表盘应用。
环境准备
本次实验环境,我们采用 Redhat Linux 操作系统,内核 2.6.1,并且在上面安装了如下的软件:
- IBM Information Server 8.7
- Inforsphere CDC 6.5.1
- DB2 9.7.4
- IBM Cognos BI Server 10.1.1
- IBM HTTP Server 7.0
在 windows 7 客户机上安装了如下的软件:
- Cognos BI Model 10.1.1
OLTP 数据源
本次试验,我们在 DB2 9.7 中创建了 db2olap 数据库作为 OLTP 数据源,包括如下表及表结构定义信息:
清单 1. 数据源定义
create table locations_s--location dimension (city_id char(8) not null primary key, prov_id varchar(10), area_id varchar(10), country_id varchar(10) ); create table city_s (city_id char(8) not null primary key, city varchar(10), city_population int); create table prov_s (prov_id varchar(10), prov varchar(10)); create table area_s (area_id varchar(10), area varchar(10)); create table products_s --products dimension (product_id varchar(10) not null primary key, sub_class_id varchar(10), class_id varchar(10)); create table product_s (product_id varchar(10) not null primary key, product varchar(50)); create table subclass_s (sub_class_id varchar(10), sub_class varchar(50)); create table times_s ---time dimension (day_id int not null primary key, day varchar(10), month_id int, month varchar(10), year_id int, year varchar(10)); create table salesperf_s (city_id char(8) not null, product_id varchar(10) not null, day_id int not null, sales decimal(10,2), costs decimal(10,2), constraint fk_day foreign key(day_id) references times_s, constraint fk_location foreign key(city_id) references locations_s, constraint fk_product foreign key(product_id) references products_s );

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

IBM suddenly announced a new round of layoffs! IBM's chief communications officer announced the news at the latest seven-minute staff meeting. This time it will mainly focus on the marketing and communications departments. Although IBM did not issue a formal statement on the scale of layoffs, as recently as January this year, they announced that they would lay off 3,400 people. The company's CEO Arvind Krishna said earlier that the company will replace 8,000 jobs with artificial intelligence in the next five years. This whole sudden operation directly confused the employees...Have these 8,000 people started laying off employees long ago△Source: One Acre and Three Minutes According to statistics from relevant websites, about 204 technology companies have laid off employees so far this year Nearly 50,000 people. Including Google parent company Alphabet, Amazon

IBM recently announced a new set of products that build on the recently launched generative AI coding capabilities for mainframe developers to make it easier to run artificial intelligence on their older data center hardware. IBM said in an update to IBMZ Designed to help customers improve business outcomes by implementing business AI across multiple industries and use cases, it includes a new set of developer toolkits supporting the most widely used AI frameworks and Python programming language, enhancements to Machine Learning forz/OS , and performance improvements in the IBMz/OS mainframe operating system. IBM said these updates will allow clients to improve the accuracy of policy advice, improve the reliability and timeliness of anti-money laundering controls, and reduce

The full English name of ibm is "International Business Machines Corporation", which refers to International Business Machines Corporation or International Business Machines Corporation, with its head office in Armonk, New York.

IBM recently launched a new enterprise investment vehicle, the Enterprise Artificial Intelligence Venture Fund. The fund aims to back startups working to develop enterprise-focused AI products. IBM's move follows other tech giants also ramping up investments in the AI startup ecosystem. A few weeks ago, AnthropicPBC, a large language model developer, received a total of up to $6 billion in financing from Amazon and Google. Recently, Hewlett Packard Enterprise and SAPSE joined a more than $500 million investment in German artificial intelligence provider AlephAlpha GmbH. IBM intends to support a number of startups "from early stage to hyper-growth" through its Enterprise Artificial Intelligence Venture Fund. According to media reports, I

IT House reported on October 31 that IBM recently issued a press release stating that in order to alleviate the current shortage of language programmers proficient in COBOL (a programming language designed and developed in 1959), it launched the generative AI programming assistant "watsonx". IBM launched a service called "IBMZ" in August this year, aiming to convert COBOL into Java language. Recently, the service was officially launched under the name "watsonx". Watsonx's core functions simplify the conversion of COBOL code to Java and are designed to modernize large computers. IBM said the artificial intelligence assistant is extremely accurate in code translation and can reduce

According to statistical analysis based on public data by intellectual property management company Anaqua, the region with the largest number of semiconductor patents in the world in 2023 will be the United States. This is the second consecutive year that the United States has topped the list. The company uses advanced AcclaimIP patent analysis software to analyze semiconductor-related patents published by the U.S. Trademark and Patent Office. The results show that the number of semiconductor patents in 2023 reached 348,774, a slight increase compared with 347,408 in 2022. According to countries and regions, the number of patents obtained by American companies is 162,557, ranking first among all countries, an increase of 18% from 2022. The data shows that Japan ranks second (40,960 cases), followed by China (2

ChatGPT is popular on the Internet, and the AI model training behind it has also attracted widespread attention. IBM Research recently announced that the cloud-native supercomputer Vela it developed can be quickly deployed and used to train basic AI models. Since May 2022, dozens of the company’s researchers have been using this supercomputer to train AI models with tens of billions of parameters. Base models are AI models trained on large amounts of unlabeled data, and their versatility means they can be used for a range of different tasks with just fine-tuning. Their scale is enormous and requires massive and costly computing power. Therefore, as experts say, computing power will become the biggest bottleneck in developing the next generation of large-scale basic models, and training them requires a lot of computing power and time. Training available

IBM put its AI and hybrid cloud strategy at the center of its annual IBMThink conference. While other vendors have been focusing on the consumer side of new AI applications over the past few years, IBM has been developing a new generation of models to better serve enterprise customers. IBM recently announced the launch of watsonx.ai, an AI development platform for hybrid cloud applications. IBM WatsonxAI development services are currently in the technology preview stage and will be generally available in the third quarter of 2023. AI will become a key business tool, ushering in a new era of productivity, creativity and value creation. For enterprises, it’s not just new AI constructs that access large language models (LLMs) through the cloud. Large language models make up Chat
