9783031006098-3031006097-Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) (Synthesis Lectures on Computer Architecture)

Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) (Synthesis Lectures on Computer Architecture)

ISBN-13: 9783031006098
ISBN-10: 3031006097
Edition: 1
Author: Wen-mei W. Hwu, Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi
Publication date: 2012
Publisher: Springer
Format: Paperback 100 pages
FREE US shipping
Buy

From $9.00

Book details

ISBN-13: 9783031006098
ISBN-10: 3031006097
Edition: 1
Author: Wen-mei W. Hwu, Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi
Publication date: 2012
Publisher: Springer
Format: Paperback 100 pages

Summary

Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) (Synthesis Lectures on Computer Architecture) (ISBN-13: 9783031006098 and ISBN-10: 3031006097), written by authors Wen-mei W. Hwu, Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi, was published by Springer in 2012. With an overall rating of 4.2 stars, it's a notable title among other books. You can easily purchase or rent Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) (Synthesis Lectures on Computer Architecture) (Paperback) from BooksRun, along with many other new and used books and textbooks. And, if you're looking to sell your copy, our current buyback offer is $0.3.

Description

General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories (less than 1 megabyte vs. 1-10 megabytes). In this book, we provide a high-level overview of current GPGPU architectures and programming models. We review the principles that are used in previous shared memory parallel platforms, focusing on recent results in both the theory and practice of parallel algorithms, and suggest a connection to GPGPU platforms. We aim to provide hints to architects about understanding algorithm aspect to GPGPU. We also provide detailed performance analysis and guide optimizations from high-level algorithms to low-level instruction level optimizations. As a case study, we use n-body particle simulations known as the fast multipole method (FMM) as an example. We also briefly survey the state-of-the-art in GPU performance analysis tools and techniques. Table of Contents: GPU Design, Programming, and Trends / Performance Principles / From Principles to Practice: Analysis and Tuning / Using Detailed Performance Analysis to Guide Optimization

Rate this book Rate this book

We would LOVE it if you could help us and other readers by reviewing the book