Skip to content

Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data

Best in textbook rentals since 2012!

ISBN-10: 0071790535

ISBN-13: 9780071790536

Edition: 2012

Authors: Paul C. Zikopoulos, Chris Eaton, Paul Zikopoulos

List price: $20.00
Blue ribbon 30 day, 100% satisfaction guarantee!
Out of stock
We're sorry. This item is currently unavailable.
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Description:

Big Data represents a new era in data exploration and utilization, and IBM is uniquely positioned to help clients navigate this transformation. This book reveals how IBM is leveraging open source Big Data technology, infused with IBM technologies, to deliver a robust, secure, highly available, enterprise-class Big Data platform.The three defining characteristics of Big Data--volume, variety, and velocity--are discussed. You'll get a primer on Hadoop and how IBM is hardening it for the enterprise, and learn when to leverage IBM InfoSphere BigInsights (Big Data at rest) and IBM InfoSphere Streams (Big Data in motion) technologies. Industry use cases are also included in this practical…    
Customers also bought

Book details

List price: $20.00
Copyright year: 2012
Publisher: McGraw-Hill Osborne
Publication date: 10/19/2011
Binding: Paperback
Pages: 176
Size: 6.00" wide x 9.00" long x 0.40" tall
Weight: 0.528
Language: English

Paul C. Zikopoulos, B.A., M.B.A., is the Director of Technical Professionals for IBM Software Group's Information Management division. In this role, he has executive responsibilities for the vitality of its client-facing technical professional community and ensuring they're among the most technically skilled personnel in the marketplace. Paul has written more than 300 magazine articles and 14 books on DB2, including Break Free with DB2: A Tour of Cost Saving Features; Information on Demand: Introduction to DB2 9.5 New Features; DB2 Fundamentals Certification for Dummies; DB2 for Dummies and more. Paul is a DB2 Certified Solutions Expert (BI and DBA).Matt Huras, B.ASc., M.Eng., is an IBM…    

Chris Eaton is a novelist and songwriter/musician from Sackville, New Brunswick, currently living in Toronto, Ontario. He is the author of three published novels: CHRIS EATON, A BIOGRAPHY, The Inactivist and The Grammar Architect . He is also the author of a retrospective book of short fiction called Letters to Thomas Pynchon . He has also recorded a half dozen CDs under the name Rock Plaza Central, including the critically acclaimed Are We Not Horses? .

Foreword
Acknowledgments
About this Book
Big Data: From the Business Perspective
What Is Big Data? Hint: You're a Part of It Every Day
Characteristics of Big Data
Can There Be Enough? The Volume of Data
Variety Is the Spice of Life
How Fast Is Fast? The Velocity of Data
Data in the Warehouse and Data in Hadoop (It's Not a Versus Thing)
Wrapping It Up
Why Is Big Data Important?
When to Consider a Big Data Solution
Big Data Use Cases: Patterns for Big Data Deployment
IT for IT Log Analytics
The Fraud Detection Pattern
They Said What? The Social Media Pattern
The Call Center Mantra: "This Call May Be Recorded for Quality Assurance Purposes"
Risk: Patterns for Modeling and Management
Big Data and the Energy Sector
Why IBM for Big Data?
Big Data Has No Big Brother: It's Ready, but Still Young
What Can Your Big Data Partner Do for You?
The IBM $100 Million Big Data Investment
A History of Big Data Innovation
Domain Expertise Matters
Big Data: From the Technology Perspective
All About Hadoop: The Big Data Lingo Chapter
Just the Facts: The History of Hadoop
Components of Hadoop
The Hadoop Distributed File System
The Basics of MapReduce
Hadoop Common Components
Application Development in Hadoop
Pig and PigLatin
Hive
Jaql
Getting Your Data into Hadoop
Basic Copy Data
Flume
Other Hadoop Components
ZooKeeper
HBase
Oozie
Lucene
Avro
Wrapping It Up
InfoSphere BigInsights: Analytics for Big Data at Rest
Ease of Use: A Simple Installation Process
Hadoop Components Included in BigInsights 1.2
A Hadoop-Ready Enterprise-Quality File System: GPFS-SNC
Extending GPFS for Hadoop: GPFS Shared Nothing Cluster
What Does a GPFS-SNC Cluster Look Like?
GPFS-SNC Failover Scenarios
GPFS-SNC POSIX-Compliance
GPFS-SNC Performance
GPFS-SNC Hadoop Gives Enterprise Qualities
Compression
Splittable Compression
Compression and Decompression
Administrative Tooling
Security
Enterprise Integration
Netezza
DB2 for Linux, UNIX, and Windows
JDBC Module
InfoSphere Streams
InfoSphere DataStage
R Statistical Analysis Applications
Improved Workload Scheduling: Intelligent Scheduler
Adaptive MapReduce
Data Discovery and Visualization: BigSheets
Advanced Text Analytics Toolkit
Machine Learning Analytics
Large-Scale Indexing
BigInsights Summed Up
IBM InfoSphere Streams: Analytics for Big Data in Motion
InfoSphere Streams Basics
Industry Use Cases for InfoSphere Streams
How InfoSphere Streams Works
What's a Stream?
The Streams Processing Language
Source and Sink Adapters
Operators
Streams Toolkits
Enterprise Class
High Availability
Consumability: Making the Platform Easy to Use
Integration is the Apex of Enterprise Class Analysis