《Algorithms and Data Structures for Massive Datasets》简介:

Algorithms and Data Structures for Massive Datasets teaches you to take advantage of data processing and analytics techniques specifically designed for large distributed datasets. And you’ll be amazed how easy it is to learn such a challenging topic from this friendly guide! Complex concepts are illustrated with interesting, entertaining graphics and fascinating industry stories that show how these techniques have succeeded in the real world. You’ll study examples including Google BigTable, BitCoin, and a smart bed sensor app, learning to build data sketches for processing, querying and exploring large datasets. By the time you’re done, you’ll be able to identify the perfect algorithm to deliver faster and more reliable results for any data intensive system.

what's inside

Sketching data structures for practical problems

Choosing the right database engine for your application

Evaluating and designing efficient on-disk data structures and algorithms

Understanding the algorithmic tradeoffs involved in massive-scale systems

Deriving basic statistics from streaming data

Correctly sampling streaming data

Computing percentiles with limited space resources