Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU) eBook Download

Download the eBook 'Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU)' by Hyesoon Kim in PDF and EPUB formats. Access the download links for mirror 1 and mirror 2.

Matthew Addison16 views0:29

🔥 Related Trending Topics

LIVE TRENDS

This video may be related to current global trending topics. Click any trend to explore more videos about what's hot right now!

THIS VIDEO IS TRENDING!

This video is currently trending in Saudi Arabia under the topic 'new zealand national cricket team vs west indies cricket team match scorecard'.

About this video

Download Performance Analysis and Tuning for General Purpose Graphics Processing Units GPGPU by Hyesoon Kim - mirror 1 ---> http://po.st/iXy1BB mirror 2 ---> http://tinyurl.com/pfgohg5 mirror 3 --> ---------------
Synopsis: General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories ( than 1 megabyte vs. 1-10 megabytes).
In this book, we provide a high-level overview of current GPGPU architectures and programming models. We review the principles that are used in previous shared memory parallel platforms, focusing on recent results in both the theory and practice of parallel algorithms, and suggest a connection to GPGPU platforms. We aim to provide hints to architects about understanding algorithm aspect to GPGPU. We also provide detailed performance analysis and guide optimizations from high-level algorithms to low-level instruction level optimizations. As a case study, we use n-body particle simulations known as the fast multipole method (FMM) as an example. We also briefly survey the state-of-the-art in GPU performance analysis tools and techniques.
Table of Contents: GPU Design, Programming, and Trends / Performance Principles / From Principles to Practice: Analysis and Tuning / Using Detailed Performance Analysis to Guide Optimization

Video Information

Views
16

Total views since publication

Duration
0:29

Video length

Published
Mar 6, 2015

Release date