Best Practices in Data Cleaning A Complete Guide to Everything You Need to Do Before and after Collecting Your Data

ISBN-10: 1412988012
ISBN-13: 9781412988018
Edition: 2013
Authors: Jason W. Osborne
List price: $43.00 Buy it from $34.45
eBook available
30 day, 100% satisfaction guarantee

If an item you ordered from TextbookRush does not meet your expectations due to an error on our part, simply fill out a return request and then return it by mail within 30 days of ordering it for a full refund of item cost.

Learn more about our returns policy

New Starting from $34.45
eBooks Starting from $21.50
what's this?
Rush Rewards U
Members Receive:
You have reached 400 XP and carrot coins. That is the daily max!

Study Briefs

Limited time offer: Get the first one free! (?)

All the information you need in one place! Each Study Brief is a summary of one specific subject; facts, figures, and explanations to help you learn faster.

Add to cart
Study Briefs
Medical Terminology Online content $4.95 $1.99
Add to cart
Study Briefs
Medical Math Online content $4.95 $1.99
Add to cart
Study Briefs
Business Ethics Online content $4.95 $1.99

Customers also bought


Book details

List price: $43.00
Copyright year: 2013
Publisher: SAGE Publications, Incorporated
Publication date: 1/10/2012
Binding: Paperback
Pages: 296
Size: 6.00" wide x 9.00" long x 0.60" tall
Weight: 1.100
Language: English

Jason W. Osborne is Professor and Department Chair (effective July 2013) of Educational and Counseling Psychology at the University of Louisville. He teaches and publishes on "best practices" in quantitative and applied research methods. He has served as evaluator or consultant on projects in public education (K-12), instructional technology, higher education, nursing and health care, medicine and medical training, epidemiology, business and marketing, and jury selection in death penalty cases. He is chief editor of Frontiers in Quantitative Psychology and Measurement as well as being involved in several other journals. Jason also publishes on identification with academics (how a student's self concept impacts motivation to succeed in academics) and on issues related to social justice and diversity (such as Stereotype Threat). He is the very proud father of three, and along with his two sons, is currently a second degree black belt in American Tae Kwon Do.

About the Author
Why Data Cleaning Is Important: Debunking the Myth of Robustness
Origins of Data Cleaning
Are Things Really That Bad?
Why Care About Testing Assumptions and Cleaning Data?
How Can This State of Affairs Be True?
The Best Practices Orientation of This Book
Data Cleaning Is a Simple Process; However…
One Path to Solving the Problem
For Further Enrichment
Best Practices as You Prepare for Data Collection
Power and Planning for Data Collection: Debunking the Myth of Adequate Power
Power and Best Practices in Statistical Analysis of Data
How Null-Hypothesis Statistical Testing Relates to Power
What Do Statistical Tests Tell Us?
How Does Power Relate to Error Rates?
Low Power and Type I Error Rates in a Literature
How to Calculate Power
The Effect of Power on the Replicability of Study Results
Can Data Cleaning Fix These Sampling Problems?
For Further Enrichment
Being True to the Target Population: Debunking the Myth of Representativeness
Sampling Theory and Generalizability
Aggregation or Omission Errors
Including Irrelevant Groups
Nonresponse and Generalizability
Consent Procedures and Sampling Bias
Generalizability of Internet Surveys
Restriction of Range
Extreme Groups Analysis
For Further Enrichment
Using Large Data Sets With Probability Sampling Frameworks: Debunking the Myth of Equality
What Types of Studies Use Complex Sampling?
Why Does Complex Sampling Matter?
Best Practices in Accounting for Complex Sampling
Does It Really Make a Difference in the Results?
So What Does All This Mean?
For Further Enrichment
Best Practices in Data Cleaning and Screening
Screening Your Data for Potential Problems: Debunking the Myth of Perfect Data
The Language of Describing Distributions
Testing Whether Your Data Are Normally Distributed
For Further Enrichment
Dealing With Missing or Incomplete Data: Debunking the Myth of Emptiness
What Is Missing or Incomplete Data?
Categories of Missingness
What Do We Do With Missing Data?
The Effects of Listwise Deletion
The Detrimental Effects of Mean Substitution
The Effects of Strong and Weak Imputation of Values
Multiple Imputation: A Modern Method of Missing Data Estimation
Missingness Can Be an Interesting Variable in and of Itself
Summing Up: What Are Best Practices?
For Further Enrichment
Extreme and Influential Data Points: Debunking the Myth of Equality
What Are Extreme Scores?
How Extreme Values Affect Statistical Analyses
What Causes Extreme Scores?
Extreme Scores as a Potential Focus of Inquiry
Identification of Extreme Scores
Why Remove Extreme Scores?
Effect of Extreme Scores on Inferential Statistics
Effect of Extreme Scores on Correlations and Regression
Effect of Extreme Scores on t-Tests and ANOVAs
To Remove or Not to Remove?
For Further Enrichment
Improving the Normality of Variables Through Box-Cox Transformation: Debunking the Myth of Distributional Irrelevance
Why Do We Need Data Transformations?
When a Variable Violates the Assumption of Normality
Traditional Data Transformations for Improving Normality
Application and Efficacy of Box-Cox Transformations
Reversing Transformations
For Further Enrichment
Does Reliability Matter? Debunking the Myth of Perfect Measurement
What Is a Reasonable Level of Reliability?
Reliability and Simple Correlation or Regression
Reliability and Partial Correlations
Reliability and Multiple Regression
Reliability and Interactions in Multiple Regression
Protecting Against Overcorrecting During Disattenuation
Other Solutions to the Issue of Measurement Error
What If We Had Error-Free Measurement?
An Example From My Research
Does Reliability Influence Other Analyses?
The Argument That Poor Reliability Is Not That Important
Conclusions and Best Practices
For Further Enrichment
Advanced Topics in Data Cleaning
Random Responding, Motivated Misresponding, and Response Sets: Debunking the Myth of the Motivated Participant
What Is a Response Set?
Common Types of Response Sets
Is Random Responding Truly Random?
Detecting Random Responding in Your Research
Does Random Responding Cause Serious Problems With Research?
Example of the Effects of Random Responding
Are Random Responders Truly Random Responders?
Best Practices Regarding Random Responding
Magnitude of the Problem
For Further Enrichment
Why Dichotomizing Continuous Variables Is Rarely a Good Practice: Debunking the Myth of Categorization
What Is Dichotomization and Why Does It Exist?
How Widespread Is This Practice?
Why Do Researchers Use Dichotomization?
Are Analyses With Dichotomous Variables Easier to Interpret?
Are Analyses With Dichotomous Variables Easier to Compute?
Are Dichotomous Variables More Reliable?
Other Drawbacks of Dichotomization
For Further Enrichment
The Special Challenge of Cleaning Repeated Measures Data: Lots of Pits in Which to Fall
Treat All Time Points Equally
What to Do With Extreme Scores?
Missing Data
Now That the Myths Are Debunked …: Visions of Rational Quantitative Methodology for the 21st Century
Name Index
Subject Index

Free shipping on orders over $35*

*A minimum purchase of $35 is required. Shipping is provided via FedEx SmartPost® and FedEx Express Saver®. Average delivery time is 1 – 5 business days, but is not guaranteed in that timeframe. Also allow 1 - 2 days for processing. Free shipping is eligible only in the continental United States and excludes Hawaii, Alaska and Puerto Rico. FedEx service marks used by permission."Marketplace" orders are not eligible for free or discounted shipping.

Learn more about the TextbookRush Marketplace.