Skip to content

Information Retrieval Implementing and Evaluating Search Engines

Spend $50 to get a free DVD!

ISBN-10: 0262026511

ISBN-13: 9780262026512

Edition: 2010

Authors: Stefan B�ttcher, Charles L. A. Clarke, Gordon V. Cormack

List price: $62.00
Blue ribbon 30 day, 100% satisfaction guarantee!
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Description:

Information retrieval is the foundation for modern search engines. This text offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus-a multiuser open-source information-retrieval system developed by one of the authors and available online-provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective,…    
Customers also bought

Book details

List price: $62.00
Copyright year: 2010
Publisher: MIT Press
Publication date: 7/23/2010
Binding: Hardcover
Pages: 632
Size: 8.25" wide x 9.25" long x 1.25" tall
Weight: 2.464
Language: English

Complete Table of Contents
Foreword
Preface
Notation
Foundations
Introduction
What Is Information Retrieval?
Information Retrieval Systems
Working with Electronic Text
Test Collections
Open-Source IR Systems
Further Reading
Exercises
Bibliography
Basic Techniques
Inverted Indices
Retrieval and Ranking
Evaluation
Summary
Further Reading
Exercises
Bibliography
Tokens and Terms
English
Characters
Character N-Grams
European Languages
CJK Languages
Further Reading
Exercises
Bibliography
Indexing
Static Inverted Indices
Index Components and Index Life Cycle
The Dictionary
Postings Lists
Interleaving Dictionary and Postings Lists
Index Construction
Other Types of Indices
Summary
Further Reading
Exercises
Bibliography
Query Processing
Query Processing for Ranked Retrieval
Lightweight Structure
Further Reading
Exercises
Bibliography
Index Compression
General-Purpose Data Compression
Symbolwise Data Compression
Compressing Postings Lists
Compressing the Dictionary
Summary
Further Reading
Exercises
Bibliography
Dynamic Inverted Indices
Batch Updates
Incremental Index Updates
Document Deletions
Document Modifications
Discussion and Further Reading
Exercises
Bibliography
Retrieval And Ranking
Probilistic Retrieval
Modeling Relevance
The Binary Independence Model
The Robertson/Sparck Jones Weighting Formula
Term Frequency
Document Length: BM25
Relevance Feedback
Field Weights: BM25F
Experimental Comparison
Further Reading
Exercises
Bibliography
Language Modeling and Related Methods
Generating Queries from Documents
Language Models and Smoothing
Ranking with Language Models
Kullback-Leibler Divergence
Divergence from Randomness
Passage Retrieval and Ranking
Experimental Comparison
Further Reading
Exercises
Bibliography
Categorization and Filtering
Detailed Examples
Classification
Probabilistic Classifiers
Linear Classifiers
Similarity-Based Classifiers
Generalized Linear Models
Information-Theoretic Models
Experimental Comparison
Further Reading
Exercises
Bibliography
Fusion and Metalearning
11.1 Search-Result Fusion
Stacking Adaptive Filters
Stacking Batch Classifiers
Bagging
Boosting
Learning to Rank
Further Reading
Exercises
Bibliography
Evaluation
Measuring Effectiveness
Traditional Effectiveness Measures
The Text REtrieval Conference (TREC)
Using Statistics in Evaluation
Minimizing Adjudication Effort
Nontraditional Effectiveness Measures
Further Reading
Exercises
Bibliography
Measuring Efficiency
Efficiency Criteria
Queueing Theory
Query Scheduling
Caching
Further Reading
Exercises
Bibliography
Applications And Extensions
Parallel Information Retrieval
Parallel Query Processing
MapReduce
Further Reading
Exercises
Bibliography
Web Search
The Structure of the Web
Queries and Users
Static Ranking
Dynamic Ranking
Evaluating Web Search
Web Crawlers
Summary
Further Reading
Exercises
Bibliography
XML Retrieval
The Essence of XML
Paths, Trees, and FLWORs
Indexing and Query Processing
Ranked Retrieval
Evaluation
Further Reading
Exercises
Bibliography
Appendix
Computer Performance
Sequential Versus Random Access on Disk
Sequential Versus Random Access in RAM
Pipelined Execution and Branch Prediction
Index