R and Data Mining Examples and Case Studies

ISBN-10: 0123969638

ISBN-13: 9780123969637

Edition: 2013

Authors: Yanchang Zhao

List price: $79.95
eBook available
30 day, 100% satisfaction guarantee

If an item you ordered from TextbookRush does not meet your expectations due to an error on our part, simply fill out a return request and then return it by mail within 30 days of ordering it for a full refund of item cost.

Learn more about our returns policy


This book introduces into using R for data mining. Data mining techniques are widely used in government agencies, banks, insurance, retail, telecom, medicine and research. Recently, there is an increasing tendency to do data mining with R, a free software environment for statistical computing and graphics . According to a poll by KDnuggets.com in early 2011, R is the 2nd popular tool for data mining work. By introducing into using R for data mining, this book will have a broad audience from both academia and industry. It targets researchers in the field of data mining, postgraduate students who are interested in data mining, as well as data miners and analysts from industry. For example, many universities have courses on data mining, and the proposed book will be a useful reference for students learning data mining in those courses. There are also many training courses on data mining in industry, such as training by SAS and IBM on data mining. The book will be interested to the course learners as well.The book will present an introduction into using R for data mining applications, coving most popular data mining techniques.Code examples and data will be provided, so that readers can easily learn the techniques.Case studies in real-world applications will be covered, which will help readers to apply the techniques in their work.
eBooks Starting from $79.95
Buy eBooks
what's this?
Rush Rewards U
Members Receive:
You have reached 400 XP and carrot coins. That is the daily max!
Study Briefs

Limited time offer: Get the first one free! (?)

All the information you need in one place! Each Study Brief is a summary of one specific subject; facts, figures, and explanations to help you learn faster.

Add to cart
Study Briefs
Periodic Table Online content $4.95 $1.99
Add to cart
Study Briefs
Calculus 1 Online content $4.95 $1.99
Add to cart
Study Briefs
Business Ethics Online content $4.95 $1.99
Add to cart
Study Briefs
Business Law Online content $4.95 $1.99
Customers also bought

Book details

List price: $79.95
Copyright year: 2013
Publisher: Elsevier Science & Technology Books
Publication date: 12/11/2012
Binding: Hardcover
Pages: 256
Size: 6.25" wide x 9.25" long x 1.00" tall
Weight: 1.452
Language: English

A Senior Data Mining Analyst in Australia Government since 2009. Before joining public sector, he was an Australian Postdoctoral Fellow (Industry) in the Faculty of Engineering & Information Technology at University of Technology, Sydney, Australia. His research interests include clustering, association rules, time series, outlier detection and data mining applications and he has over forty papers published in journals and conference proceedings. He is a member of the IEEE and a member of the Institute of Analytics Professionals of Australia, and served as program committee member for more than thirty international conferences.

List of Figures
List of Abbreviations
Data Mining
The Iris Dataset
The Bodyfat Dataset
Data Import and Export
Save and Load R Data
Import from and Export to .CSV Files
Import Data from SAS
Import/Export via ODBC
Read from Databases
Output to and Input from EXCEL Files
Data Exploration
Have a Look at Data
Explore Individual Variables
Explore Multiple Variables
More Explorations
Save Charts into Files
Decision Trees and Random Forest
Decision Trees with Package party
Decision Trees with Package rpart
Random Forest
Linear Regression
Logistic Regression
Generalized Linear Regression
Non-Linear Regression
The k-Means Clustering
The k-Medoids Clustering
Hierarchical Clustering
Density-Based Clustering
Outlier Detection
Univariate Outlier Detection
Outlier Detection with LOF
Outlier Detection by Clustering
Outlier Detection from Time Series
Time Series Analysis and Mining
Time Series Data in R
Time Series Decomposition
Time Series Forecasting
Time Series Clustering
Dynamic Time Warping
Synthetic Control Chart Time Series Data
Hierarchical Clustering with Euclidean Distance
Hierarchical Clustering with DTW Distance
Time Series Classification
Classification with Original Data
Classification with Extracted Features
k-NN Classification
Further Readings
Association Rules
Basics of Association Rules
The Titanic Dataset
Association Rule Mining
Removing Redundancy
Interpreting Rules
Visualizing Association Rules
Discussions and Further Readings
Text Mining
Retrieving Text from Twitter
Transforming Text
Stemming Words
Building a Term-Document Matrix
Frequent Terms and Associations
Word Cloud
Clustering Words
Clustering Tweets
Clustering Tweets with the k-Means Algorithm
Clustering Tweets with the k-Medoids Algorithm
Packages, Further Readings, and Discussions
Social Network Analysis
Network of Terms
Network of Tweets
Two-Mode Network
Discussions and Further Readings
Case Study I: Analysis and Forecasting of House Price Indices
Importing HPI Data
Exploration of HPI Data
Trend and Seasonal Components of HPI
HPI Forecasting
The Estimated Price of a Property
Case Study II: Customer Response Prediction and Profit Optimization
The Data of KDD Cup 1998
Data Exploration
Training Decision Trees
Model Evaluation
Selecting the Best Tree
Discussions and Conclusions
Case Study III: Predictive Modeling of Big Data with Limited Memory
Data and Variables
Random Forest
Memory Issue
Train Models on Sample Data
Build Models with Selected Variables
Print Rules
Print Rules in Text
Print Rules for Scoring with SAS
Conclusions and Discussion
Online Resources
R Reference Cards
Data Mining
Data Mining with R
Classification/Prediction with R
Time Series Analysis with R
Association Rule Mining with R
Spatial Data Analysis with R
Text Mining with R
Social Network Analysis with R
Data Cleansing and Transformation with R
Big Data and Parallel Computing with R
R Reference Card for Data Mining
General Index
Package Index
Function Index
Free shipping on orders over $35*

*A minimum purchase of $35 is required. Shipping is provided via FedEx SmartPost® and FedEx Express Saver®. Average delivery time is 1 – 5 business days, but is not guaranteed in that timeframe. Also allow 1 - 2 days for processing. Free shipping is eligible only in the continental United States and excludes Hawaii, Alaska and Puerto Rico. FedEx service marks used by permission."Marketplace" orders are not eligible for free or discounted shipping.

Learn more about the TextbookRush Marketplace.