| |
| |
Foreword | |
| |
| |
Preface | |
| |
| |
Acknowledgments | |
| |
| |
About This Book | |
| |
| |
| |
The Big Deal About Big Data | |
| |
| |
| |
What Is Big Data? | |
| |
| |
Why Is Big Data Important? | |
| |
| |
Now, the "What Is Big Data?" Part | |
| |
| |
Brought to You by the Letter V: How We Define Big Data | |
| |
| |
What About My Data Warehouse in a Big Data World? | |
| |
| |
Wrapping It Up | |
| |
| |
| |
Applying Big Data to Business Problems: A Sampling of Use Cases | |
| |
| |
When to Consider a Big Data Solution | |
| |
| |
Before We Start: Big Data, Jigsaw Puzzles, and Insight | |
| |
| |
Big Data Use Cases: Patterns for Big Data Deployment | |
| |
| |
You Spent the Money to Instrument It-Now Exploit It! | |
| |
| |
IT for IT: Data Center, Machine Data, and Log Analytics | |
| |
| |
What, Why, and Who? Social Media: Analytics | |
| |
| |
Understanding Customer Sentiment | |
| |
| |
Social Media Techniques Make the World Your Oyster | |
| |
| |
Customer State: Or, Don't Try to Upsell Me When I Am Mad | |
| |
| |
Fraud Detection: "Who Buys an Engagement Ring at 4 a.m.?" | |
| |
| |
Liquidity and Risk: Moving from Aggregate to Individual | |
| |
| |
Wrapping It Up | |
| |
| |
| |
Boost Your Big Data IQ: The IBM Big Data Platform | |
| |
| |
The New Era of Analytics | |
| |
| |
Key Considerations for the Analytic Enterprise | |
| |
| |
The Big Data Platform Manifesto | |
| |
| |
IBM's Strategy for Big Data and Analytics | |
| |
| |
| |
Sustained Investments in Research and Acquisitions | |
| |
| |
| |
Strong Commitment to Open Source Efforts and a Fostering of Ecosystem Development | |
| |
| |
| |
Support Multiple Entry Points to Big Data | |
| |
| |
A Flexible, Platform-Based Approach to Big Data | |
| |
| |
Wrapping It Up | |
| |
| |
| |
Analytics for Big Data at Rest | |
| |
| |
| |
A Big Data Platform for High-Performance Deep Analytics: IBM PureData Systems | |
| |
| |
Netezza's Design Principles | |
| |
| |
Appliance Simplicity: Minimize the Human Effort | |
| |
| |
Hardware Acceleration: Process Analytics Close to the Data Store | |
| |
| |
Balanced, Massively Parallel Architecture: Deliver Linear Scalability | |
| |
| |
Modular Design: Support Flexible Configurations and Extreme Scalability | |
| |
| |
What's in the Box? The Netezza Appliance Architecture Overview | |
| |
| |
A Look Inside the Netezza Appliance | |
| |
| |
The Secret Sauce: FPGA-Assisted Analytics | |
| |
| |
Query Orchestration in Netezza | |
| |
| |
Platform for Advanced Analytics | |
| |
| |
Extending the Netezza Analytics Platform with Hadoop | |
| |
| |
Customers' Success Stories: The Netezza Experience | |
| |
| |
T-Mobile: Delivering Extreme Performance with Simplicity at the Petabyte Scale | |
| |
| |
State University of New York: Using Analytics to Help Find a Cure for Multiple Sclerosis | |
| |
| |
NYSE Euronext: Reducing Data Latency and Enabling Rapid Ad-Hoc Searches | |
| |
| |
| |
IBM's Enterprise Hadoop: InfoSphere Biglnsights | |
| |
| |
What the Hadoop! | |
| |
| |
Where Elephants Come From: The History of Hadoop | |
| |
| |
Components of Hadoop and Related Projects | |
| |
| |
Hadoop 2.0 | |
| |
| |
What's in the Box: The Components of InfoSphere Biglnsights | |
| |
| |
Hadoop Components Included in InfoSphere Biglnsights 2.0 | |
| |
| |
The Biglnsights Web Console | |
| |
| |
The Biglnsights Development Tools | |
| |
| |
Biglnsights Editions: Basic and Advanced | |
| |
| |
Deploying Biglnsights | |
| |
| |
Ease of Use: A Simple Installation Process | |
| |
| |
A Low-Cost Way to Get Started: Running Biglnsights on the Cloud | |
| |
| |
Higher-Class Hardware: IBM PowerLinux Solution for Big Data | |
| |
| |
Cloudera Support | |
| |
| |
Analytics: Exploration, Development, and Deployment | |
| |
| |
Advanced Text Analytics Toolkit | |
| |
| |
Machine Learning for the Masses: Deep Statistical Analysis on Biglnsights | |
| |
| |
Analytic Accelerators: Finding Needles in Haystacks of Needles? | |
| |
| |
Apps for the Masses: Easy Deployment and Execution of Custom Applications | |
| |
| |
Data Discovery and Visualization: BigSheets | |
| |
| |
The Biglnsights Development Environment | |
| |
| |
The Biglnsights Application Lifecycle | |
| |
| |
Data Integration | |
| |
| |
The Anlaytics-Based IBM PureData Systems and DB2 | |
| |
| |
JDBC Module | |
| |
| |
InfoSphere Streams for Data in Motion | |
| |
| |
InfoSphere DataStage | |
| |
| |
Operational Excellence | |
| |
| |
Securing the Cluster | |
| |
| |
Monitoring All Aspects of Your Cluster | |
| |
| |
Compression | |
| |
| |
Improved Workload Scheduling: Intelligent Scheduler | |
| |
| |
Adaptive MapReduce | |
| |
| |
A Flexible File System for Hadoop: GPFS-FPO | |
| |
| |
Wrapping It Up | |
| |
| |
| |
Analytics for Big Data in Motion | |
| |
| |
| |
Real-Time Analytical Processing with InfoSphere Streams | |
| |
| |
The Basics: InfoSphere Streams | |
| |
| |
How InfoSphere Streams Works | |
| |
| |
What's a Lowercase "stream"? | |
| |
| |
Programming Streams Made Easy | |
| |
| |
The Streams Processing Language | |
| |
| |
Source and Sink Adapters | |
| |
| |
Operators | |
| |
| |
Streams Toolkits | |
| |
| |
Enterprise Class | |
| |
| |
High Availability | |
| |
| |
Integration Is the Apex of Enterprise Class Analysis | |
| |
| |
Industry Use Cases for InfoSphere Streams | |
| |
| |
Telecommunications | |
| |
| |
Enforcement, Defense, Surveillance, and Cyber Security | |
| |
| |
Financial Services Sector | |
| |
| |
Health and Life Sciences | |
| |
| |
And the Rest We Can't Fit in This Book | |
| |
| |
Wrapping It Up | |
| |
| |
| |
Unlocking Big Data | |
| |
| |
| |
If Data Is the New Oil-You Need Data Exploration and Discovery | |
| |
| |
Indexing Data from Multiple Sources with InfoSphere Data Explorer | |
| |
| |
Connector Framework | |
| |
| |
The Data Explorer Processing Layer | |
| |
| |
User Management Layer | |
| |
| |
Beefing Up InfoSphere Biglnsights | |
| |
| |
An App with a View: Creating Information Dashboards with InfoSphere Data Explorer Application Builder | |
| |
| |
Wrapping It Up: Data Explorer Unlocks Big Data | |
| |
| |
| |
Big Data Analytic Accelerators | |
| |
| |
| |
Differentiate Yourself with Text Analytics | |
| |
| |
What Is Text Analysis? | |
| |
| |
The Annotated Query Language to the Rescue! | |
| |
| |
Productivity Tools That Make All the Difference | |
| |
| |
Wrapping It Up | |
| |
| |
| |
The IBM Big Data Analytic Accelerators | |
| |
| |
The IBM Accelerator for Machine Data Analytics | |
| |
| |
Ingesting Machine Data | |
| |
| |
Extract | |
| |
| |
Index | |
| |
| |
Transform | |
| |
| |
Statistical Modeling | |
| |
| |
Visualization | |
| |
| |
Faceted Search | |
| |
| |
The IBM Accelerator for Social Data Analytics | |
| |
| |
Feedback Extractors: What Are People Saying? | |
| |
| |
Profile Extractors: Who Are These People? | |
| |
| |
Workflow: Pulling It All Together | |
| |
| |
The IBM Accelerator for Telecommunications Event Data Analytics | |
| |
| |
Call Detail Record Enrichment | |
| |
| |
Network Quality Monitoring | |
| |
| |
Customer Experience Indicators | |
| |
| |
Wrapping It Up: Accelerating Your Productivity | |
| |
| |
| |
Integration and Governance in a Big Data World | |
| |
| |
| |
To Govern or Not to Govern: Governance in a Big Data World | |
| |
| |
Why Should Big Data be Governed? | |
| |
| |
Competing on Information and Analytics | |
| |
| |
The Definition of Information Integration and Governance | |
| |
| |
An Information Governance Process | |
| |
| |
The IBM Information Integration and Governance Technology Platform | |
| |
| |
IBM InfoSphere Business Information Exchange | |
| |
| |
IBM InfoSphere Information Server | |
| |
| |
Data Quality | |
| |
| |
Master Data Management | |
| |
| |
Data Lifecycle Management | |
| |
| |
Privacy and Security | |
| |
| |
Wrapping It Up: Trust Is About Turning Big Data into Trusted Information | |
| |
| |
| |
Integrating Big Data in the Enterprise | |
| |
| |
Analytic Application Integration | |
| |
| |
IBM Cognos Software | |
| |
| |
IBM Content Analytics with Enterprise Search | |
| |
| |
SPSS | |
| |
| |
SAS | |
| |
| |
Unica | |
| |
| |
Q1 Labs: Security Solutions | |
| |
| |
IBM i2 Intelligence Analysis Platform | |
| |
| |
Platform Symphony MapReduce | |
| |
| |
Component Integration Within the IBM Big Data Platform | |
| |
| |
InfoSphere Biglnsights | |
| |
| |
InfoSphere Streams | |
| |
| |
Data Warehouse Solutions | |
| |
| |
The Advanced Text Analytics Toolkit | |
| |
| |
InfoSphere Data Explorer | |
| |
| |
InfoSphere Information Server | |
| |
| |
InfoSphere Master Data Management | |
| |
| |
InfoSphere Guardium | |
| |
| |
InfoSphere Optim | |
| |
| |
WebSphere Front Office | |
| |
| |
WebSphere Decision Server: iLog Rule | |
| |
| |
Rational | |
| |
| |
Data Repository-Level Integration | |
| |
| |
Enterprise Platform Plug-ins | |
| |
| |
Development Tooling | |
| |
| |
Analytics | |
| |
| |
Visualization | |
| |
| |
Wrapping It Up | |