《Algorithms and Data Structures for Massive Datasets》简介:
Algorithms and Data Structures for Massive Datasets teaches you to take advantage of data processing and analytics techniques specifically designed for large distributed datasets. And you’ll be amazed how easy it is to learn such a challenging topic from this friendly guide! Complex concepts are illustrated with interesting, entertaining graphics and fascinating industry stories that show how these techniques have succeeded in the real world. You’ll study examples including Google BigTable, BitCoin, and a smart bed sensor app, learning to build data sketches for processing, querying and exploring large datasets. By the time you’re done, you’ll be able to identify the perfect algorithm to deliver faster and more reliable results for any data intensive system.
what's inside
Sketching data structures for practical problems
Choosing the right database engine for your application
Evaluating and designing efficient on-disk data structures and algorithms
Understanding the algorithmic tradeoffs involved in massive-scale systems
Deriving basic statistics from streaming data
Correctly sampling streaming data
Computing percentiles with limited space resources