Skip to navigation Skip to content
Careers | Phone Book | A - Z Index
Performance and Algorithms Research

Roofline Performance Model

Roofline is a visually intuitive performance model created by Samuel Williams that is used to bound the performance of various numerical methods and operations running on multicore, manycore, or accelerator processor architectures. Rather than simply using percent-of-peak estimates, the model can be used to assess the quality of attained performance by combining locality, bandwidth, and different parallelization paradigms into a single performance figure. One can examine the resultant Roofline figure in order to determine both the implementation and inherent performance limitations.

  • Introduction The Roofline model is oriented around the interplay between application data locality, data bandwidth, and computational throughput.
  • Software We have created, collaborated, and leveraged, on a number of tools to affect Roofline modeling.
  • People A number of researchers have and continue to contribute to the Roofline effort at LBL Samuel Williams Charlene Yang Khaled Ibrahim Nan Ding Thorsten Kurth Doug Doerfler Yunsong Wang Jonathan Madsen Jack Deslippe Leonid Oliker Terry Ligocki Brian Van Straalen Matthew Cordery Linda Lo   …
  • SC23 Tutorial The Roofline performance model offers an insightful and intuitive method for extracting the key execution characteristics of HPC applications and comparing them against the performance ­bounds of modern CPUs and GPUs. Its ability to abstract the complexity of memory hierarchies and identify the most profitable optimization techniques have made Roofline­-based analysis increasingly popular in the HPC community. The tutorial will introduce the fundamental aspects behind different Roofline…
  • ECP23 Tutorial Video The Roofline performance model offers an insightful and intuitive method for extracting the key execution characteristics of HPC applications and comparing them against the performance ­bounds of modern CPUs and GPUs. Its ability to abstract the complexity of memory hierarchies and identify the most profitable optimization techniques have made Roofline­-based analysis increasingly popular in the HPC community. The tutorial will introduce the fundamental aspects behind different Roofline…
  • Publications The following represents a core list of Roofline-related publications. They can provide a more in-depth discussion of the theory, application, and nuances associated with using the Roofline…