free shipping on orders over $35*
BUYBACK CART Buyback Cart Total Buyback Cart Total
free shipping on buybacks!

    Understanding High-Dimensional Spaces

    ISBN-10: 3642333974
    ISBN-13: 9783642333972
    Author(s): David B. Skillicorn
    Description: High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on  More...
    Buy it from: $43.20
    This item will ship on Tuesday, May 26.

    The first one is FREE! All the information you need in one place—a topical tool kit in digital form. Through June 15, 2015, add a Study Brief to your cart with a book purchase or rental and the discount will be applied at checkout.
    Study Briefs
    Digital only List price: $1.99
    Study Briefs
    MS Excel 2010
    Digital only List price: $1.99
    Study Briefs
    MS Word 2010
    Digital only List price: $1.99
    Study Briefs
    MS PowerPoint 2010
    Digital only List price: $1.99
    Customers Also Bought

    Publisher: Springer
    Binding: Paperback
    Pages: 108
    Size: 6.25" wide x 9.25" long x 0.25" tall
    Weight: 0.396
    Language: English

    High-dimensional spaces arise as a way of modelling datasets with many attributes. Such a dataset can be directly represented in a space spanned by its attributes, with each record represented as a point in the space with its position depending on its attribute values. Such spaces are not easy to work with because of their high dimensionality: our intuition about space is not reliable, and measures such as distance do not provide as clear information as we might expect. There are three main areas where complex high dimensionality and large datasets arise naturally: data collected by online retailers, preference sites, and social media sites, and customer relationship databases, where there are large but sparse records available for each individual; data derived from text and speech, where the attributes are words and so the corresponding datasets are wide, and sparse; and data collected for security, defense, law enforcement, and intelligence purposes, where the datasets are large and wide. Such datasets are usually understood either by finding the set of clusters they contain or by looking for the outliers, but these strategies conceal subtleties that are often ignored. In this book the author suggests new ways of thinking about high-dimensional spaces using two models: a skeleton that relates the clusters to one another; and boundaries in the empty space between clusters that provide new perspectives on outliers and on outlying regions. The book will be of value to practitioners, graduate students and researchers.

    A Natural Representation of Data Similarity
    Basic Structure of High-Dimensional Spaces
    Comparing Attributes
    Comparing Records
    High-Dimensional Spaces
    Improving the Natural Geometry
    Singular Value Decompositions
    Random Projections
    Algorithms that Find Standalone Clusters
    Clusters Based on Density
    Parallel Coordinates
    Independent Component Analysis
    Latent Dirichlet Allocation
    Algorithms that Find Clusters and Their Relationships
    Clusters Based on Distance
    Clusters Based on Distribution
    Semidiscrete Decomposition
    Hierarchical Clustering
    Minimum Spanning Tree with Collapsing
    Overall Process for Constructing a Skeleton
    Algorithms that Wrap Clusters
    1-Class Support Vector Machines
    Autoassociative Neural Networks
    Algorithms to Place Boundaries Between Clusters
    Support Vector Machines
    Random Forests
    Overall Process for Constructing Empty Space
    Spaces with a Single Center
    Using Distance
    Using Density
    Understanding the Skeleton
    Understanding Empty Space
    Spaces with Multiple Centers
    What is a Cluster?
    Identifying Clusters
    Clusters Known Already
    Finding Clusters
    Finding the Skeleton
    Empty Space
    An Outer Boundary and Novel Data
    Interesting Data
    One-Cluster Boundaries
    One-Cluster-Against-the-Rest Boundaries
    Representation by Graphs
    Building a Graph from Records
    Local Similarities
    Embedding Choices
    Using the Embedding for Clustering
    Using Models of High-Dimensional Spaces
    Understanding Clusters
    Structure in the Set of Clusters
    Semantic Stratified Sampling
    Ranking Using the Skeleton
    Ranking Using Empty Space
    Applications to Streaming Data
    Including Contextual Information
    What is Context?
    Changing Data
    Changing Analyst and Organizational Properties
    Changing Algorithmic Properties
    Letting Context Change the Models
    Recomputing the View
    Recomputing Derived Structures
    Recomputing the Clustering

    Buy it from $43.20

    Please choose a buying option

    Your Price:
    You save:
    Buy It Now
    PLUS weekly prizes!
    Get a extra entry for each item purchased or sold.
    what's this?
    Rush Rewards U
    Members Receive:
    You have reached 400 XP and carrot coins. That is the daily max!
    Free shipping on orders over $35*

    *A minimum purchase of $35 is required. Shipping is provided via FedEx SmartPost® and FedEx Express Saver®. Average delivery time is 1 – 5 business days, but is not guaranteed in that timeframe. Also allow 1 - 2 days for processing. Free shipping is eligible only in the continental United States and excludes Hawaii, Alaska and Puerto Rico. FedEx service marks used by permission."Marketplace" orders are not eligible for free or discounted shipping.

    Learn more about the TextbookRush Marketplace.