Skip to content

Bad Data Handbook Cleaning up the Data So You Can Get Back to Work

Best in textbook rentals since 2012!

ISBN-10: 1449321887

ISBN-13: 9781449321888

Edition: 2012

Authors: Q. Ethan McCallum

List price: $31.99
Blue ribbon 30 day, 100% satisfaction guarantee!
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Description:

Welcome to data science’s dirty secret: real-world data is messy. Data scientists must spend a good deal of time playing software developer, writing code to clean up data before they can actually do anything constructive with it.It’s a necessary evil, but you can still make the most of it. This practical book walks you through several real-world examples to demonstrate the theory and practice behind working with and cleaning up dirty data.No one tool solves all of the problems well. Wise data scientists learn many tools and learn where each one shines. To that end, this book takes a polyglot approach: most examples will involve R and Python, but expect the occasional smattering of Groovy…    
Customers also bought

Book details

List price: $31.99
Copyright year: 2012
Publisher: O'Reilly Media, Incorporated
Publication date: 11/20/2012
Binding: Paperback
Pages: 264
Size: 6.97" wide x 9.09" long x 0.55" tall
Weight: 0.946
Language: English

Q Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. His work has appeared online on The O’Reilly Network and Java.net, and also in print publications such as C/C++ Users Journal, Doctor Dobb’s Journal, and Linux Magazine. In his professional roles, he helps companies to make smart decisions about data and technology.