Rate this book

CUDA Fortran for Scientists and Engineers: Best Practices for Efficient CUDA Fortran Programming

Gregory Ruetsch, Massimiliano Fatica

Rate this book

CUDA Fortran for Scientists and Engineers shows how high-performance application developers can leverage the power of GPUs using Fortran, the familiar language of scientific computing and supercomputer performance benchmarking. The authors presume no prior parallel computing experience, and cover the basics along with best practices for efficient GPU computing using CUDA Fortran. To help you add CUDA Fortran to existing Fortran codes, the book explains how to understand the target GPU architecture, identify computationally intensive parts of the code, and modify the code to manage the data and parallelism and optimize performance. All of this is done in Fortran, without having to rewrite in another language. Each concept is illustrated with actual examples so you can immediately evaluate the performance of your code in comparison.

338 pages, Paperback

First published January 1, 2013

About the author

Gregory Ruetsch

2 books1 follower

What do you think?

Rate this book

Friends & Following

Create a free account to discover what your friends think of this book!

Community Reviews

5 stars

1 (25%)

4 stars

3 (75%)

3 stars

0 (0%)

2 stars

0 (0%)

1 star

0 (0%)

Displaying 1 of 1 review

Jörn Dinkla

Author 1 book2 followers

January 28, 2015

Sorry, this review is in german. I do not have enough time to translate it into english. In a few years this probably will be automatically translated.
------------------------------------------------------------
Beim Stöbern in diesem Buch habe ich gesehen, dass die Autoren wertvolle Optimierungstipps geben. Da bin ich neugierig geworden. Ich selber bin kein Fortran-Programmierer, kann daher das Buch aus dieser Perspektive nicht beurteilen. Aber für mich C++-Entwickler bietet dieses Buch viele wertvolle Hinweise über die Optimierung von CUDA-Kerneln. Als Beispiel kann hier die mathematisch präzise Analyse der "Fused Multiply Add (FMA)”-Operation in Abschnitt 5.2.1 genannt werden. Das Buch wurde von zwei sehr kompetenten Autoren geschrieben und ist sehr solide. Die Autoren arbeiten beide bei NVIDIA, verfügen daher über Insider-Informationen.

Das Buch ist leider schon ein wenig veraltet, die aktuelle Maxwell-Architektur ist erst 2014 erschienen. Im Buch wird die Kepler-Architektur bis zur Compute Capability 3.5 behandelt.

Einen Punkt Abzug gibt es aber, doch, weil im Anhang auf 61 Seiten Source-Code ausgedruckt wurde. Das ist doch heute nicht mehr zeitgemäß, den kann man sich herunterladen.

Displaying 1 of 1 review