Skip to content

Best Practices in Data Cleaning A Complete Guide to Everything You Need to Do Before and after Collecting Your Data

Best in textbook rentals since 2012!

ISBN-10: 1412988012

ISBN-13: 9781412988018

Edition: 2013

Authors: Jason W. Osborne

List price: $68.00
Shipping box This item qualifies for FREE shipping.
Blue ribbon 30 day, 100% satisfaction guarantee!
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Customers also bought

Book details

List price: $68.00
Copyright year: 2013
Publisher: SAGE Publications, Incorporated
Publication date: 1/10/2012
Binding: Paperback
Pages: 296
Size: 6.00" wide x 9.00" long x 0.60" tall
Weight: 0.836

Jason W. Osborne is Professor and Department Chair (effective July 2013) of Educational and Counseling Psychology at the University of Louisville. He teaches and publishes on "best practices" in quantitative and applied research methods. He has served as evaluator or consultant on projects in public education (K-12), instructional technology, higher education, nursing and health care, medicine and medical training, epidemiology, business and marketing, and jury selection in death penalty cases. He is chief editor of Frontiers in Quantitative Psychology and Measurement as well as being involved in several other journals. Jason also publishes on identification with academics (how a student's…    

Preface
About the Author
Why Data Cleaning Is Important: Debunking the Myth of Robustness
Origins of Data Cleaning
Are Things Really That Bad?
Why Care About Testing Assumptions and Cleaning Data?
How Can This State of Affairs Be True?
The Best Practices Orientation of This Book
Data Cleaning Is a Simple Process; However…
One Path to Solving the Problem
For Further Enrichment
Best Practices as You Prepare for Data Collection
Power and Planning for Data Collection: Debunking the Myth of Adequate Power
Power and Best Practices in Statistical Analysis of Data
How Null-Hypothesis Statistical Testing Relates to Power
What Do Statistical Tests Tell Us?
How Does Power Relate to Error Rates?
Low Power and Type I Error Rates in a Literature
How to Calculate Power
The Effect of Power on the Replicability of Study Results
Can Data Cleaning Fix These Sampling Problems?
Conclusions
For Further Enrichment
Appendix
Being True to the Target Population: Debunking the Myth of Representativeness
Sampling Theory and Generalizability
Aggregation or Omission Errors
Including Irrelevant Groups
Nonresponse and Generalizability
Consent Procedures and Sampling Bias
Generalizability of Internet Surveys
Restriction of Range
Extreme Groups Analysis
Conclusion
For Further Enrichment
Using Large Data Sets With Probability Sampling Frameworks: Debunking the Myth of Equality
What Types of Studies Use Complex Sampling?
Why Does Complex Sampling Matter?
Best Practices in Accounting for Complex Sampling
Does It Really Make a Difference in the Results?
So What Does All This Mean?
For Further Enrichment
Best Practices in Data Cleaning and Screening
Screening Your Data for Potential Problems: Debunking the Myth of Perfect Data
The Language of Describing Distributions
Testing Whether Your Data Are Normally Distributed
Conclusions
For Further Enrichment
Appendix
Dealing With Missing or Incomplete Data: Debunking the Myth of Emptiness
What Is Missing or Incomplete Data?
Categories of Missingness
What Do We Do With Missing Data?
The Effects of Listwise Deletion
The Detrimental Effects of Mean Substitution
The Effects of Strong and Weak Imputation of Values
Multiple Imputation: A Modern Method of Missing Data Estimation
Missingness Can Be an Interesting Variable in and of Itself
Summing Up: What Are Best Practices?
For Further Enrichment
Appendixes
Extreme and Influential Data Points: Debunking the Myth of Equality
What Are Extreme Scores?
How Extreme Values Affect Statistical Analyses
What Causes Extreme Scores?
Extreme Scores as a Potential Focus of Inquiry
Identification of Extreme Scores
Why Remove Extreme Scores?
Effect of Extreme Scores on Inferential Statistics
Effect of Extreme Scores on Correlations and Regression
Effect of Extreme Scores on t-Tests and ANOVAs
To Remove or Not to Remove?
For Further Enrichment
Improving the Normality of Variables Through Box-Cox Transformation: Debunking the Myth of Distributional Irrelevance
Why Do We Need Data Transformations?
When a Variable Violates the Assumption of Normality
Traditional Data Transformations for Improving Normality
Application and Efficacy of Box-Cox Transformations
Reversing Transformations
Conclusion
For Further Enrichment
Appendix
Does Reliability Matter? Debunking the Myth of Perfect Measurement
What Is a Reasonable Level of Reliability?
Reliability and Simple Correlation or Regression
Reliability and Partial Correlations
Reliability and Multiple Regression
Reliability and Interactions in Multiple Regression
Protecting Against Overcorrecting During Disattenuation
Other Solutions to the Issue of Measurement Error
What If We Had Error-Free Measurement?
An Example From My Research
Does Reliability Influence Other Analyses?
The Argument That Poor Reliability Is Not That Important
Conclusions and Best Practices
For Further Enrichment
Advanced Topics in Data Cleaning
Random Responding, Motivated Misresponding, and Response Sets: Debunking the Myth of the Motivated Participant
What Is a Response Set?
Common Types of Response Sets
Is Random Responding Truly Random?
Detecting Random Responding in Your Research
Does Random Responding Cause Serious Problems With Research?
Example of the Effects of Random Responding
Are Random Responders Truly Random Responders?
Summary
Best Practices Regarding Random Responding
Magnitude of the Problem
For Further Enrichment
Why Dichotomizing Continuous Variables Is Rarely a Good Practice: Debunking the Myth of Categorization
What Is Dichotomization and Why Does It Exist?
How Widespread Is This Practice?
Why Do Researchers Use Dichotomization?
Are Analyses With Dichotomous Variables Easier to Interpret?
Are Analyses With Dichotomous Variables Easier to Compute?
Are Dichotomous Variables More Reliable?
Other Drawbacks of Dichotomization
For Further Enrichment
The Special Challenge of Cleaning Repeated Measures Data: Lots of Pits in Which to Fall
Treat All Time Points Equally
What to Do With Extreme Scores?
Missing Data
Summary
Now That the Myths Are Debunked …: Visions of Rational Quantitative Methodology for the 21st Century
Name Index
Subject Index