| |
| |
Foreword | |
| |
| |
Preface | |
| |
| |
About the Author | |
| |
| |
| |
Introduction to Data Virtualization | |
| |
| |
| |
Introduction | |
| |
| |
| |
The World of Business Intelligence Is Changing | |
| |
| |
| |
Introduction to Virtualization | |
| |
| |
| |
What Is Data Virtualization? | |
| |
| |
| |
Data Virtualization and Related Concepts | |
| |
| |
| |
Data Virtualization versus Encapsulation and Information Hiding | |
| |
| |
| |
Data Virtualization versus Abstraction | |
| |
| |
| |
Data Virtualization versus Data Federation | |
| |
| |
| |
Data Virtualization versus Data Integration | |
| |
| |
| |
Data Virtualization versus Enterprise Information Integration | |
| |
| |
| |
Definition of Data Virtualization | |
| |
| |
| |
Technical Advantages of Data Virtualization | |
| |
| |
| |
Different Implementations of Data Virtualization | |
| |
| |
| |
Overview of Data Virtualization Servers | |
| |
| |
| |
Open versus Closed Data Virtualization Servers | |
| |
| |
| |
Other Forms of Data Integration | |
| |
| |
| |
The Modules of a Data Virtualization Server | |
| |
| |
| |
The History of Data Virtualization | |
| |
| |
| |
The Sample Database: World Class Movies | |
| |
| |
| |
Structure of This Book | |
| |
| |
| |
Business Intelligence and Data Warehousing | |
| |
| |
| |
Introduction | |
| |
| |
| |
What Is Business Intelligence? | |
| |
| |
| |
Management Levels and Decision Making | |
| |
| |
| |
Business Intelligence Systems | |
| |
| |
| |
The Data Stores of a Business Intelligence System | |
| |
| |
| |
The Data Warehouse | |
| |
| |
| |
The Data Marts | |
| |
| |
| |
The Data Staging Area | |
| |
| |
| |
The Operational Data Store | |
| |
| |
| |
The Personal Data Stores | |
| |
| |
| |
A Comparison of the Different Types of Data Stores | |
| |
| |
| |
Normalized Schemas, Star Schemas, and Snowflake Schemas | |
| |
| |
| |
Normalized Schemas | |
| |
| |
| |
Denormalized Schemas | |
| |
| |
| |
Star Schemas | |
| |
| |
| |
Snowflake Schemas | |
| |
| |
| |
Data Transformation with Extract Transform Load, Extract Load Transform, and Replication | |
| |
| |
| |
Extract Transform Load | |
| |
| |
| |
Extract Load Transform | |
| |
| |
| |
Replication | |
| |
| |
| |
Overview of Business Intelligence Architectures | |
| |
| |
| |
New Forms of Reporting and Analytics | |
| |
| |
| |
Operational Reporting and Analytics | |
| |
| |
| |
Deep and Big Data Analytics | |
| |
| |
| |
Self-Service Reporting and Analytics | |
| |
| |
| |
Unrestricted Ad-Hoc Analysis | |
| |
| |
| |
360-Degree Reporting | |
| |
| |
| |
Exploratory Analysis | |
| |
| |
| |
Text-Based Analysis | |
| |
| |
| |
Disadvantages of Classic Business Intelligence Systems | |
| |
| |
| |
Summary | |
| |
| |
| |
Data Virtualization Server: The Building Blocks | |
| |
| |
| |
Introduction | |
| |
| |
| |
The High-Level Architecture of a Data Virtualization Server | |
| |
| |
| |
Importing Source Tables and Defining Wrappers | |
| |
| |
| |
Defining Virtual Tables and Mappings | |
| |
| |
| |
Examples of Virtual Tables and Mappings | |
| |
| |
| |
Virtual Tables and Data Modeling | |
| |
| |
| |
Nesting Virtual Tables and Shared Specifications | |
| |
| |
| |
Importing Nonrelational Data | |
| |
| |
| |
XML and JSON Documents | |
| |
| |
| |
Web Services | |
| |
| |
| |
Spreadsheets | |
| |
| |
| |
NoSQL Databases | |
| |
| |
| |
Multidimensional Cubes and MDX | |
| |
| |
| |
Semistructured Data | |
| |
| |
| |
Unstructured Data | |
| |
| |
| |
Publishing Virtual Tables | |
| |
| |
| |
The Internal Data Model | |
| |
| |
| |
Updatable Virtual Tables and Transaction Management | |
| |
| |
| |
Data Virtualization Server: Management and Security | |
| |
| |
| |
Introduction | |
| |
| |
| |
Impact and Lineage Analysis | |
| |
| |
| |
Synchronization of Source Tables, Wrapper Tables, and Virtual Tables | |
| |
| |
| |
Security of Data: Authentication and Authorization | |
| |
| |
| |
Monitoring, Management, and Administration | |
| |
| |
| |
Data Virtualization Server: Caching of Virtual Tables | |
| |
| |
| |
Introduction | |
| |
| |
| |
The Cache of a Virtual Table | |
| |
| |
| |
When to Use Caching | |
| |
| |
| |
Caches versus Data Marts | |
| |
| |
| |
Where Is the Cache Kept? | |
| |
| |
| |
Refreshing Caches | |
| |
| |
| |
Full Refreshing, Incremental Refreshing, and Live Refreshing | |
| |
| |
| |
Online Refreshing and Offline Refreshing | |
| |
| |
| |
Cache Replication | |
| |
| |
| |
Data Virtualization Server: Query Optimization Techniques | |
| |
| |
| |
Introduction | |
| |
| |
| |
A Refresher Course on Query Optimization | |
| |
| |
| |
The Ten Stages of Query Processing by a Data Virtualization Server | |
| |
| |
| |
The Intelligence Level of the Data Stores | |
| |
| |
| |
Optimization through Query Substitution | |
| |
| |
| |
Optimization through Pushdown | |
| |
| |
| |
Optimization through Query Expansion (Query Injection) | |
| |
| |
| |
Optimization through Ship Joins | |
| |
| |
| |
Optimization through Sort-Merge Joins | |
| |
| |
| |
Optimization by Caching | |
| |
| |
| |
Optimization and Statistical Data | |
| |
| |
| |
Optimization through Hints | |
| |
| |
| |
Optimization through SQL Override | |
| |
| |
| |
Explaining the Processing Strategy | |
| |
| |
| |
Deploying Data Virtualization in Business Intelligence Systems | |
| |
| |
| |
Introduction | |
| |
| |
| |
A Business Intelligence System Based on Data Virtualization | |
| |
| |
| |
Advantages of Deploying Data Virtualization | |
| |
| |
| |
Disadvantages of Deploying Data Virtualization | |
| |
| |
| |
Strategies for Adopting Data Virtualization | |
| |
| |
| |
Strategy 1: Introducing Data Virtualization in an Existing Business Intelligence System | |
| |
| |
| |
Strategy 2: Developing a New Business Intelligence System with Data Virtualization | |
| |
| |
| |
Strategy 3: Developing a New Business Intelligence System Combining Source and Transformed Data | |
| |
| |
| |
Application Areas of Data Virtualization | |
| |
| |
| |
Unified Data Access | |
| |
| |
| |
Virtual Data Mart | |
| |
| |
| |
Virtual Data Warehouse-Based on Data Marts | |
| |
| |
| |
Virtual Data Warehouse-Based on Production Databases | |
| |
| |
| |
Extended Data Warehouse | |
| |
| |
| |
Operational Reporting and Analytics | |
| |
| |
| |
Operational Data Warehouse | |
| |
| |
| |
Virtual Corporate Data Warehouse | |
| |
| |
| |
Self-Service Reporting and Analytics | |
| |
| |
| |
Virtual Sandbox | |
| |
| |
| |
Prototyping | |
| |
| |
| |
Analyzing Semistructured and Unstructured Data | |
| |
| |
| |
Disposable Reports | |
| |
| |
| |
Extending Business Intelligence Systems with External Users | |
| |
| |
| |
Myths on Data Virtualization | |
| |
| |
| |
Design Guidelines for Data Virtualization | |
| |
| |
| |
Introduction | |
| |
| |
| |
Incorrect Data and Data Quality | |
| |
| |
| |
Different Forms of Incorrect Data | |
| |
| |
| |
Integrity Rules and Incorrect Data | |
| |
| |
| |
Filtering, Flagging, and Restoring Incorrect Data | |
| |
| |
| |
Examples of Filtering Incorrect Data | |
| |
| |
| |
Examples of Flagging Incorrect Data | |
| |
| |
| |
Examples of Restoring Misspelled Data | |
| |
| |
| |
Complex and Irregular Data Structures | |
| |
| |
| |
Codes without Names | |
| |
| |
| |
Inconsistent Key Values | |
| |
| |
| |
Repeating Groups | |
| |
| |
| |
Recursive Data Structures | |
| |
| |
| |
Implementing Transformations in Wrappers or Mappings | |
| |
| |
| |
Analyzing Incorrect Data | |
| |
| |
| |
Different Users and Different Definitions | |
| |
| |
| |
Time Inconsistency of Data | |
| |
| |
| |
Data Stores and Data Transmission | |
| |
| |
| |
Retrieving Data from Production Systems | |
| |
| |
| |
Joining Historical and Operational Data | |
| |
| |
| |
Dealing with Organizational Changes | |
| |
| |
| |
Archiving Data | |
| |
| |
| |
Data Virtualization and Service-Oriented Architecture | |
| |
| |
| |
Introduction | |
| |
| |
| |
Service-Oriented Architectures in a Nutshell | |
| |
| |
| |
Basic Services, Composite Services, Business Process Services, and Data Services | |
| |
| |
| |
Developing Data Services with a Data Virtualization Server | |
| |
| |
| |
Developing Composite Services with a Data Virtualization Server | |
| |
| |
| |
Services and the Internal Data Model | |
| |
| |
| |
Data Virtualization and Master Data Management | |
| |
| |
| |
Introduction | |
| |
| |
| |
Data Is a Critical Asset for Every Organization | |
| |
| |
| |
The Need for a 360-Degree View of Business Objects | |
| |
| |
| |
What Is Master Data? | |
| |
| |
| |
What Is Master Data Management? | |
| |
| |
| |
A Master Data Management System | |
| |
| |
| |
Master Data Management for Integrating Data | |
| |
| |
| |
Integrating Master Data Management and Data Virtualization | |
| |
| |
| |
Data Virtualization, Information Management, and Data Governance | |
| |
| |
| |
Introduction | |
| |
| |
| |
Impact of Data Virtualization on Information Modeling and Database Design | |
| |
| |
| |
Impact of Data Virtualization on Data Profiling | |
| |
| |
| |
Impact of Data Virtualization on Data Cleansing | |
| |
| |
| |
Impact of Data Virtualization on Data Governance | |
| |
| |
| |
The Data Delivery Platform-A New Architecture for Business Intelligence Systems | |
| |
| |
| |
Introduction | |
| |
| |
| |
The Data Delivery Platform in a Nutshell | |
| |
| |
| |
The Definition of the Data Delivery Platform | |
| |
| |
| |
The Data Delivery Platform and Other Business Intelligence Architectures | |
| |
| |
| |
The Requirements of the Data Delivery Platform | |
| |
| |
| |
The Data Delivery Platform versus Data Virtualization | |
| |
| |
| |
Explanation of the Name | |
| |
| |
| |
A Personal Note | |
| |
| |
| |
The Future of Data Virtualization | |
| |
| |
| |
Introduction | |
| |
| |
| |
The Future of Data Virtualization According to Rick F. van der Lans | |
| |
| |
| |
New and Enhanced Query Optimization Techniques | |
| |
| |
| |
Exploiting New Hardware Technology | |
| |
| |
| |
Extending the Design Module | |
| |
| |
| |
Data Quality Features | |
| |
| |
| |
Support for the Push-Model for Data Access | |
| |
| |
| |
Blending of Data Virtualization, Extract Transform Load, Extract Load Transform, and Replication | |
| |
| |
| |
The Future of Data Virtualization According to David Besemer, CTO of Composite Software | |
| |
| |
| |
The Empowered Consumer Gains Ubiquitous Data Access | |
| |
| |
| |
IT's Back Office Becomes the Cloud | |
| |
| |
| |
Data Virtualization of the Future Is a Global Data Fabric | |
| |
| |
| |
Conclusion | |
| |
| |
| |
The Future of Data Virtualization According to Alberto Pan, CTO of Denodo Technologies | |
| |
| |
| |
The Future of Data Virtualization According to James Markarian, CTO of Informatica Corporation | |
| |
| |
| |
How to Maximize Return on Data with Data Virtualization | |
| |
| |
| |
Beyond Looking Under the Hood | |
| |
| |
Bibliography | |
| |
| |
Index | |