Problem-solving in High Performance Computing: A Situational Awareness Approach with Linux
Author | : | |
Rating | : | 4.37 (871 Votes) |
Asin | : | 0128010193 |
Format Type | : | paperback |
Number of Pages | : | 320 Pages |
Publish Date | : | 2017-03-03 |
Language | : | English |
DESCRIPTION:
The book offers unique, real-life examples that emphasize the magnitude and operational complexity of high performance computer systems. The authors offer perspective gained from years of developing Intel-based systems that lead the industry in the number of hosts, software tools, and licenses used in chip design. Problem-Solving in High Performance Computing: A Situational Awareness Approach with Linux focuses on understanding giant computing grids as cohesive systems. Provides insider perspectives on challenges in high performance environments with thousands of servers, millions of cores, distributed data centers, and petabytes of shared dataCovers analysis, troubleshooting, and system optimization, from initial diagnostics to deep dives into kernel crash dumpsPresents macro principles that appeal to a wide range of users and various real-life, complex problemsIncludes examples from 24/7 mission-critical environments with specific HPC operational constraints. Un
kgRun a modern system? You want this book. kg4giy In this day, the idea of the single server, single system is almost an anachronism, but troubleshooting the modern stack is still attacked in the same way. Igor points out that this is not the best way to tackle these problems, because, after all, there are new tools and new techniques for dealing with this new model. Virtual system, virtual memory, host systems, processes and just the shear amount of data that is needed to sort through to a resolution means we all need to come at these problems slightly differently than we used to. But, he also points out that we are also not jettisoning some of . giy said Run a modern system? You want this book.. In this day, the idea of the single server, single system is almost an anachronism, but troubleshooting the modern stack is still attacked in the same way. Igor points out that this is not the best way to tackle these problems, because, after all, there are new tools and new techniques for dealing with this new model. Virtual system, virtual memory, host systems, processes and just the shear amount of data that is needed to sort through to a resolution means we all need to come at these problems slightly differently than we used to. But, he also points out that we are also not jettisoning some of
Igor has twelve years of experience in the hi-tech industry, first as a physicist and lately in various engineering roles, with a strong focus on data-driven methodologies. He has authored several open-source projects and technical books, numerous articles accepted for publication in leading technical journals and magazines, and presented at prestigious international conferences. . To date, Igor has had fifteen p
. From the Back CoverProblem-solving in High Performance Computing is focused on understanding giant computing grids as cohesive systems. It highlights the difference between standalone system troubleshooting and complex problem solving in large, mission critical environments, and addresses the pitfalls of information overload, micro and macro symptoms in problem solving, and offers highly modular, extensible, scalable tools and methods for managing problems in large computing ecosystems.The authors offer perspective gained from years of developing Intel-based systems that lead the industry in the number of hosts, software tools and licenses used in chip design. They cover the entire stack, from identifying the problem, understanding and reproducing it; to solving it, using a methodical, top-down approach, while handling the constraints of a global computing