Skip to content

Mining of Massive Datasets

ISBN-10: 1107077230

ISBN-13: 9781107077232

Edition: 2nd 2014 (Revised)

Authors: Jure Leskovec, Anand Rajaraman, Jeffrey David Ullman

List price: $52.00
Blue ribbon 30 day, 100% satisfaction guarantee!
Buy eBooks
what's this?
Rush Rewards U
Members Receive:
Carrot Coin icon
XP icon
You have reached 400 XP and carrot coins. That is the daily max!

Description:

Written by leading authorities in database and Web technologies, this book is essential reading for students and practitioners alike. The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms that have been used to solve key problems in data mining and can be applied successfully to even the largest datasets. It begins with a discussion of the map-reduce framework, an important tool for parallelizing algorithms automatically. The authors explain the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing. Other chapters cover the PageRank idea and related tricks for organizing the Web, the problems of finding frequent itemsets and clustering. This second edition includes new and extended coverage on social networks, machine learning and dimensionality reduction.
Customers also bought

Book details

List price: $52.00
Edition: 2nd
Copyright year: 2014
Publisher: Cambridge University Press
Publication date: 11/13/2014
Binding: Hardcover
Pages: 476
Size: 7.25" wide x 10.00" long x 1.25" tall
Weight: 1.562
Language: English

Jure Leskovec is Assistant Professor of Computer Science at Stanford University. His research focuses on mining large social and information networks. Problems he investigates are motivated by large scale data, the Web and on-line media. This research has won several awards including a Microsoft Research Faculty Fellowship, the Alfred P. Sloan Fellowship, Okawa Foundation Fellowship, and numerous best paper awards. His research has also been featured in popular press outlets such as the New York Times, the Wall Street Journal, the Washington Post, MIT Technology Review, NBC, BBC, CBC and Wired. Leskovec has also authored the Stanford Network Analysis Platform (SNAP, http://snap.stanford.edu), a general purpose network analysis and graph mining library that easily scales to massive networks with hundreds of millions of nodes and billions of edges. You can follow him on Twitter at @jure.

Anand Rajaraman is CEO of Kosmix Inc., a website which organizes the Internet by topic. He is also a consulting assistant professor in the Computer Science Department at Stanford University. In 1996, together with four other engineers, Rajaraman founded Junglee Corp., which pioneered Internet comparison shopping. It was acquired by Amazon.com Inc. in August 1998 for 1.6 million shares of stock valued at $250 million. Rajaraman went on to become Director of Technology at Amazon.com, where he was responsible for technology strategy. He helped launch the transformation of Amazon.com from a retailer into a retail platform, enabling third-party retailers to sell on Amazon.com's website. Third-party transactions now account for almost 25% of all US transactions, and represent Amazon's fastest-growing and most profitable business segment. Rajaraman was also an inventor of the concept underlying Amazon.com's Mechanical Turk. Rajaraman and his business partner, Venky Harinarayan, co-founded Cambrian Ventures, an early stage VC fund, in 2000. Cambrian went on to back several companies later acquired by Google and has funded companies like Mobissimo, Aster Data Systems and TheFind.com.

Jeffrey David Ullman is the Stanford W. Ascherman Professor of Computer Science (Emeritus) at Stanford University. He is also the CEO of Gradiance. Ullman's research interests include database theory, data integration, data mining and education using the information infrastructure. He is one of the founders of the field of database theory and was the doctoral advisor of an entire generation of students who later became leading database theorists in their own right. He was also the Ph.D. advisor of Sergey Brin, one of the co-founders of Google, and served on Google's technical advisory board. In 1995 he was inducted as a Fellow of the Association for Computing Machinery and in 2000 he was awarded the Knuth Prize. Ullman is also the co-recipient (with John Hopcroft) of the 2010 IEEE John von Neumann Medal, for 'laying the foundations for the fields of automata and language theory and many seminal contributions to theoretical computer science'.