Skip to content

Beautiful Data The Stories Behind Elegant Data Solutions

Spend $50 to get a free DVD!

ISBN-10: 0596157118

ISBN-13: 9780596157111

Edition: 2009

Authors: Toby Segaran, Jeff Hammerbacher, T. Segaran

List price: $56.99
Shipping box This item qualifies for FREE shipping.
Blue ribbon 30 day, 100% satisfaction guarantee!
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Customers also bought

Book details

List price: $56.99
Copyright year: 2009
Publisher: O'Reilly Media, Incorporated
Publication date: 7/31/2009
Binding: Paperback
Pages: 386
Size: 9.00" wide x 6.75" long x 1.00" tall
Weight: 1.738
Language: English

Jeff Hammerbacher is the Vice President of Products and Chief Scientist at Cloudera. Jeff was an Entrepreneur in Residence at Accel Partners immediately prior to joining Cloudera. Before Accel, he conceived, built, and led the Data team at Facebook. The Data team was responsible for driving many of the statistics and machine learning applications at Facebook, as well as building out the infrastructure to support these tasks for massive data sets. The team produced several academic papers and two open source projects: Hive, a system for offline analysis built above Hadoop, and Cassandra, a structured storage system on a P2P network. Before joining Facebook, Jeff was a quantitative analyst on…    

Preface
Seeing Your Life in Data
Personal Environmental Impact Report (PEIR)
your.flowingdata (YFD)
Personal Data Collection
Data Storage
Data Processing
Data Visualization
The Point
How to Participate
The Beautiful People: Keeping Users in Mind When Designing Data Collection Methods
Introduction: User Empathy Is the New Black
The Project: Surveying Customers About a New Luxury Product
Specific Challenges to Data Collection
Designing Our Solution
Results and Reflection
Embedded Image Data Processing on Mars
Abstract
Introduction
Some Background
To Pack or Not to Pack
The Three Tasks
Slotting the Images
Passing the Image: Communication Among the Three Tasks
Getting the Picture: Image Download and Processing
Image Compression
Downlink, or, It's All Downhill from Here
Conclusion
Cloud Storage Design in a Pnutshell
Introduction
Updating Data
Complex Queries
Comparison with Other Systems
Conclusion
Information Platforms and the Rise of the Data Scientist
Libraries and Brains
Facebook Becomes Self-Aware
A Business Intelligence System
The Death and Rebirth of a Data Warehouse
Beyond the Data Warehouse
The Cheetah and the Elephant
The Unreasonable Effectiveness of Data
New Tools and Applied Research
MAD Skills and Cosmos
Information Platforms As Dataspaces
The Data Scientist
Conclusion
The Geographic Beauty of a Photographic Archive
Beauty in Data: Geograph
Visualization, Beauty, and Treemaps
A Geographic Perspective on Geograph Term Use
Beauty in Discovery
Reflection and Conclusion
Data Finds Data
Introduction
The Benefits of Just-in-Time Discovery
Corruption at the Roulette Wheel
Enterprise Discoverability
Federated Search Ain't All That
Directories: Priceless
Relevance: What Matters and to Whom?
Components and Special Considerations
Privacy Considerations
Conclusion
Portable Data In Real Time
Introduction
The State of the Art
Social Data Normalization
Conclusion: Mediation via Gnip
Surfacing the Deep Web
What Is the Deep Web?
Alternatives to Offering Deep-Web Access
Conclusion and Future Work
Building Radiohead's House of Cards
How It All Started
The Data Capture Equipment
The Advantages of Two Data Capture Systems
The Data
Capturing the Data, aka "The Shoot"
Processing the Data
Post-Processing the Data
Launching the Video
Conclusion
Visualizing Urban Data
Introduction
Background
Cracking the Nut
Making It Public
Revisiting
Conclusion
The design of sense.us
Visualization and Social Data Analysis
Data
Visualization
Collaboration
Voyagers and Voyeurs
Conclusion
What Data Doesn't do
When Doesn't Data Drive?
Conclusion
Natural Language Corpus Data
Word Segmentation
Secret Codes
Spelling Correction
Other Tasks
Discussion and Conclusion
Life in Data: The Story of DNA
DNA As a Data Store
DNA As a Data Source
Fighting the Data Deluge
The Future of DNA
Beautifying Data in the Real World
The Problem with Real Data
Providing the Raw Data Back to the Notebook
Validating Crowdsourced Data
Representing the Data Online
Closing the Loop: Visualizations to Suggest New Experiments
New Experiments
Building a Data Web from Open Data and Free Services
Superficial Data Analysis: Exploring Millions of Social Stereotypes
Introduction
Preprocessing the Data
Exploring the Data
Age, Attractiveness, and Gender
Looking at Tags
Which Words Are Gendered?
Clustering
Conclusion
Bay Area Blues: The Effect of the Housing Crisis
Introduction
How Did We Get the Data?
Geocoding
Data Checking
Analysis
The Influence of Inflation
The Rich Get Richer and the Poor Get Poorer
Geographic Differences
Census Information
Exploring San Francisco
Conclusion
Beautiful Political Data
Example 1: Redistricting and Partisan Bias
Example 2: Time Series of Estimates
Example 3: Age and Voting
Example 4: Public Opinion and Senate Voting on Supreme Court Nominees
Example 5: Localized Partisanship in Pennsylvania
Conclusion
Connecting Data
What Public Data Is There, Really?
The Possibilities of Connected Data
Within Companies
Impediments to Connecting Data
Possible Solutions
Conclusion
Contributors
Index