A - Z Index | Phone Book | Careers

John Shalf

JohnShalf.jpg
John M. Shalf
Department Head & Group Lead
Phone: +1 510 486 4508 | +1 510 316 9427
Fax: +1 510 486 4300

Journal Articles

M. Wehner, L. Oliker, J. Shalf, D. Donofrio, L. Drummond, et al., "Hardware/Software Co-design of Global Cloud System Resolving Models", Journal of Advances in Modeling Earth Systems (JAMES), 2011, 3, M1000:22, doi: 10.1029/2011MS000073

Computer2011

D. Donofrio, L. Oliker. J. Shalf, M. Wehner, C. Rowen, J. Krueger, S. Kamil, M. Mohiyuddin, "Energy-Efficient Computing for Extreme-Scale Science", IEEE Computer, November 2009, 42, No. :62-71, doi: 10.1109/MC.2009.35

 

 

M. Wehner, L. Oliker., and J. Shalf, "Low Power Supercomputers", IEEE Spectrum, October 2009,

High-performance computing for such things as climate modeling is not going to advance at anything like the pace it has during the last two decades unless we apply fundamentally new ideas. Here we describe one possible approach. Rather than constructing supercomputers from the kinds of microprocessors found in fast desktop computers or servers, we propose adopting designs and design principles drawn, oddly enough, from the portable-electronics marketplace.

S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, "Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms", Parallel Computing Journal, 2009, doi: 10.1016/j.parco.2008.12.006

K. Datta, S. Kamil, S. Williams, L. Oliker, J. Shalf, K. Yelick, "Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors", SIAM Review (SIREV), cover article, 2009, doi: 10.1137/070693199

S. Kamil, L. Oliker, A. Pinar, J. Shalf, "Communication Requirements and Interconnect Optimization for High-End Scientific Applications", IEEE Transactions on Parallel and Distributed Systems (TPDS), 2009,

Samuel Williams, Jonathan Carter, Leonid Oliker, John Shalf, Katherine Yelick, "Optimization of a Lattice Boltzmann Computation on State-of-the-art Multicore Platforms", J. Parallel Distrib. Comput., 2009, 69:762--777, doi: 10.1016/j.jpdc.2009.04.002

J. Borrill, L. Oliker, J. Shalf, H. Shan, A. Uselton, "HPC Global File System Performance Analysis Using A Scientific-Application Derived Benchmark", Parallel Computing Journal, 2009,

S. Kamil, L. Oliker, A. Pinar, J. Shalf, "Communication Requirements and Interconnect Optimization for High-End Scientific Applications", IEEE Transactions on Parallel and Distributed Systems, Volume . IEEE TPDS: 2009, January 1, 2009,

S. Kamil, L. Oliker, A. Pinar, J. Shalf, "Communication Requirements and Interconnect Optimization for High-End Scientific Applications\", IEEE Transactions on Parallel and Distributed Systems, Volume . IEEE, 2009,

John Shalf and Jason Hick (Arie Shoshani and Doron Rotem), "Storage Technology Fundamentals", Scientific Data Management: Challenges, Technology, and Deployment, Volume . Chapman & Hall/CRC, 2009,

S. Williams, K. Datta, J. Carter, L. Oliker, J. Shalf, K. Yelick, D. Bailey, "PERI: Auto-tuning Memory Intensive Kernels for Multicore", SciDAC PI Meeting, Journal of Physics: Conference Series, 125 012038, July 2008,

M. Wehner, L. Oliker, J. Shalf, "Performance Characterization of the World's Most Powerful Supercomputers", Internation Journal of High Performance Computing Applications (IJHPCA), April 2008,

M. Wehner, L. Oliker, J. Shalf, "Towards Ultra-High Resolution Models of Climate and Weather", Internation Journal of High Performance Computing Applications (IJHPCA), April 2008,

Shantenu Jha, Hartmut Kaiser, Andre Merzky, John Shalf, "SAGA - The Simple API for Grid Applications - Motivation, Design, and Implementation", Encyclopedia of Grid Technologies and Applications, Volume 1. Information Science Reference (www.info-sci-ref.com), 2008,

S. Williams, J. Shalf, L. Oliker, P. Husbands, S. Kamil, K. Yelick, "Scientific Computing Kernels on the Cell Processor", International Journal of Parallel Programming (IJPP), July 2007, doi: 10.1007/s10766-007-0034-5

Jonathan Carter, Leonid Oliker, John Shalf, "Performance Evaluation of Scientific Applications on Modern Parallel Vector Systems", Lecture Notes in Computer Science, 2007, 4395:490--503,

J. Shalf, L. Oliker, M. Lijewski, S. Kamil, J. Carter, A. Canning, S. Ethier, "Performance Characteristics of Potential Petascale Scientific Applications", Chapman \& Hall/CRC Computational Science, 2007, 1,

Book Chapter

John Shalf, "The New Landscape of Parallel Computer Architecture", Journal of Physics: Conference Series, Volume . IOP Electronics Journals, 2007,

Tom Goodale, Shantenu Jha, Hartmut Kaiser, Thilo Kielmann, Pascal Kleijer, Gregor von Laszewski, Craig Lee, Andre Merzky, Hrabri Rajic, Hrabri, John Shalf, "SAGA: A Simple API for Grid Applications -- High-Level Application Programming on the Grid", Computational Methods in Science and Technology, Volume 12(1). Poznan, 2006, LBNL 59066,

H. Simon, W. Kramer, W. Saphir, J. Shalf, D. Bailey, L. Oliker, et al, "Science Driven System Architecture: A New Process for Leadsership Class Computing", Journal of the Earth Simulator, 2005,

L. Oliker, A. Canning, J. Carter, J. Shalf, H. Simon, S. Ethier, D. Parks, S. Kitawaki, Y. Tsuda, T. Sato, "Performance of Ultra-Scale Applications on Leading Vector and Scalar HPC Platforms", Journal of the Earth Simulator, January 2005, 3,

Leonid Oliker, Andrew Canning, Jonathan Carter, John Shalf, David Skinner, St\ ephane Ethier, Rupak Biswas, Jahed Djomehri, Rob Van der Wijngaart, "Performance Evaluation of the SX-6 Vector Architecture for Scientific Computations", Concurrency and Computation: Practice and Experience, January 2005, 17:69--93, doi: 10.1002/cpe.884

John Shalf, John Bell, Andrew Canning, Lin-Wang Wang, Juan Meza, Rob Ryne, Ji Qiang, Kathy Yelick, "Berkeley Petascale Applications", 2005,

Conference Papers

George Michelogiannakis, Alexander Williams, Samuel Williams, John Shalf, "Collective Memory Transfers for Multi-Core Chips", International Conference on Supercomputing, June 2014,

M. Jung, E. H. Wilson III, W. Choi, J. Shalf, H. M. Aktulga, C. Yang, E. Saule, U. V. Catalyurek, M. Kandemir, "Exploring the Future of Out-of-core Computing with Compute-Local Non-Volatile Memory", International Conference for High Performance Computing, Networking, Storage and Analysis 2013 (SC13), NY, USA, ACM New York, November 17, 2013, doi: 10.1145/2503210.2503261

George Michelogiannakis, John shalf, "Variable-Width Datapath for On-Chip Network Static Power Reduction", Under review (currently LBNL technical report in process), November 2013,

  • Download File: abn.pdf (pdf: 440 KB)

George Michelogiannakis, Xiaoye S. Li, David H. Bailey, John Shalf, "Extending Summation Precision for Network Reduction Operations", 25th International Symposium on Computer Architecture and High Performance Computing, IEEE Computer Society, October 2013,

Double precision summation is at the core of numerous important algorithms such as Newton-Krylov methods and other operations involving inner products, but the effectiveness of summation is limited by the accumulation of rounding errors, which are an increasing problem with the scaling of modern HPC systems and data sets. To reduce the impact of precision loss, researchers have proposed increased- and arbitrary-precision libraries that provide reproducible error or even bounded error accumulation for large sums, but do not guarantee an exact result. Such libraries can also increase computation time significantly. We propose big integer (BigInt) expansions of double precision variables that enable arbitrarily large summations without error and provide exact and reproducible results. This is feasible with performance comparable to that of double-precision floating point summation, by the inclusion of simple and inexpensive logic into modern NICs to accelerate performance on large-scale systems. 

Cy Chan, Didem Unat, Michael Lijewski, Weiqun Zhang, John Bell, John Shalf, "Software Design Space Exploration for Exascale Combustion Co-Design", International Supercomputing Conference (ISC), Leipzig, Germany, June 16, 2013,

S. Williams, D. Kalamkar, A. Singh, A. Deshpande, B. Van Straalen, M. Smelyanskiy, A. Almgren, P. Dubey, J. Shalf, L. Oliker, "Optimization of Geometric Multigrid for Emerging Multi- and Manycore Processors", Supercomputing (SC), November 2012,

Hongzhang Shan, Brian Austin, Nicholas Wright, Erich Strohmaier, John Shalf, Katherine Yelick, "Accelerating Applications at Scale Using One-Sided Communication", Santa Barbara, CA, The 6th Conference on Partitioned Global Address Programming Models, October 10, 2012,

Mehmet Balman, Eric Pouyoul, Yushu Yao, E. Wes Bethel, Burlen Loring, Prabhat, John Shalf, Alex Sim, and Brian L. Tierney, "Experiences with 100G Network Applications", In Proceedings of the Fifth international Workshop on Data-intensive Distributed Computing, in conjunction with ACM High Performance Distributing Computing (HPDC) Conference, 2012, Delft, Netherlands, June 2012, LBNL 5603E, doi: http://dx.doi.org/10.1145/2286996.2287004

100Gbps networking has finally arrived, and many research and educational in- stitutions have begun to deploy 100Gbps routers and services. ESnet and Internet2 worked together to make 100Gbps networks available to researchers at the Super- computing 2011 conference in Seattle Washington. In this paper, we describe two of the first applications to take advantage of this network. We demonstrate a visu- alization application that enables remotely located scientists to gain insights from large datasets. We also demonstrate climate data movement and analysis over the 100Gbps network. We describe a number of application design issues and host tuning strategies necessary for enabling applications to scale to 100Gbps rates. 

Hongzhang Shan, Nicholas J. Wright, John Shalf, Katherine Yelick, Marcus Wagner, nathan Wichmann, "A Preliminary Evaluation of the Hardware Acceleration of the Cray Gemini Interconnect for PGAS Languages and Comparison with MPI", PMBS'11 Proceedings of the Second International Workshop on Performance Modeling, Benchmarking, and, November 10, 2011,

J. Krueger, D. Donofrio, J. Shalf, M. Mohiyuddin, S. Williams, L. Oliker, F. Pfreundt, "Hardware/Software Co-design for Energy-Efficient Seismic Modeling", Supercomputing (SC), November 2011,

Samuel Williams, Leonid Oliker, Jonathan Carter, John Shalf, "Extracting Ultra-scale Lattice Boltzmann Performance via Hierarchical and Distributed Auto-tuning", SC 11, New York, NY, USA, ACM, January 2011, 55:1--55:1, doi: 10.1145/2063384.2063458

Kamesh Madduri, Khaled Z. Ibrahim, Samuel Williams, Stephane Ethier Eun-Jin Im, John Shalf, Leonid Oliker, "Gyrokinetic Toroidal Simulations on Leading Multi- and Manycore HPC Systems", The International Conference for High Performance Computing, Networking, Storage, and Analysis, January 2011,

Keith Jackson, Lavanya Ramakrishnan, Krishna Muriki, Shane Canon, Shreyas Cholia, John Shalf, Harvey Wasserman, Nicholas Wright, "Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud", 2nd IEEE International Conference on Cloud Computing Technology and Science, 2010,

Lavanya Ramakrishnan, Keith Jackson, Shane Canon, Shreyas Cholia, John Shalf, "Defining Future Platform Requirements for e-Science Cloud (Position paper)", ACM Symposium on Cloud Computing 2010 (ACM SOCC 2010), Indianapolis, Indiana, 2010,

Mark Howison, Quincey Koziol, David Knaak, John Mainzer, John Shalf, "Tuning HDF5 for Lustre File Systems", Proceedings of 2010 Workshop on Interfaces and Abstractions for Scientific Data Storage (IASDS10), Heraklion, Crete, Greece, September 2010, LBNL 4803E,

G. Hendry, J, Chan, S, Kamil, L. Oliker , J. Shalf, L. Carloni , K. Bergman, "Silicon Nanophotonic Network-On-Chip using TDM Arbitration", Hot Interconnects, August 2010,

J. A. Colmenares, S. Bird, H. Cook, P. Pearce, D. Zhu, J. Shalf, S. Hofmeyr, K. Asanovic, J. Kubiatowicz, "Resource Management in the Tessellation Manycore OS", 2nd Usenix Workshop on Hot Topics in Parallelism (HotPar), June 15, 2010,

Shoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, Samuel Williams, "An Auto-tuning Framework for Parallel Multicore Stencil Computations", International Parallel and Distributed Processing Symposium (IPDPS), April 20, 2010,

A. Uselton, M. Howison, N. Wright, D. Skinner, N. Keen, J. Shalf, K. Karavanic, L. Oliker, "Parallel I/O Performance: From Events to Ensembles", International Parallel & Distributed Processing Symposium (IPDPS), 2010,

Kamesh Madduri, Samuel Williams, Stephane Ethier, Leonid Oliker, John Shalf, Erich Strohmaier, Katherine Yelick, "Memory-Efficient Optimization of Gyrokinetic Particle-to-Grid Interpolation for Multicore Processors", Supercomputing (SC), November 2009,

J. Shalf, M. Wehner, L. Oliker, "The Challenge of Energy-Efficient HPC", SCIDAC Review, Fall, 2009,

M. Mohiyuddin, M. Murphy, L. Oliker, J. Shalf, J. Wawrzynek, S. Williams, "A Design Methodology for Domain-Optimized Power-Efficient Supercomputing", Supercomputing (SC), 2009,

Shoaib Kamil, Cy Chan, Samuel Williams, Leonid Oliker, John Shalf, Mark Howison, E. Wes Bethel, Prabhat, "A Generalized Framework for Auto-tuning Stencil Computations", Cray User Group Conference (CUG) (Best Paper Award), Atlanta, GA, May 4, 2009, LBNL 2078E,

S. Williams, J. Carter, L. Oliker, J. Shalf, K. Yelick, "Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4", Proceedings of the Cray User Group (CUG), Atlanta, GA, 2009,

J. Gebis, L. Oliker, J. Shalf, S. Williams, K. Yelick, "Improving Memory Subsystem Performance using ViVA: Virtual Vector Architecture", Architecture of Computing Systems (ARCS), Delft, Netherlands, 2009,

K. Datta, S. Williams, V. Volkov, J. Carter, L. Oliker, J. Shalf, K. Yelick, "Auto-Tuning the 27-point Stencil for Multicore", Proceedings of Fourth International Workshop on Automatic Performance Tuning (iWAPT2009), January 2009,

G. Hendry, S. Kamil, A. Biberman, J. Chan, B. Lee, M. Mohiyuddin, A. Jain, K. Bergman, L. Carloni, J. Kubiatowicz, L. Oliker, J. Shalf, "Analysis of Photonic Networks for a Chip Multiprocessor Using Scientific Applications", International Symposium on Networks-on-Chip (NOCS), 2009,

G. Hendry, S.A. Kamil, A. Biberman, J. Chan, B.G. Lee, M Mohiyuddin, A. Jain, K. Bergman, L.P. Carloni, J. Kubiatocics, L. Oliker, J. Shalf, "Analysis of Photonic Networks for Chip Multiprocessor Using Scientific Applications", NOCS2009, 2009,

B.V. Straalen, J. Shalf, T. Ligocki, N. Keen, and W. Yang, "Scalability Challenges for Massively Parallel AMR Application", 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009., 2009,

K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter, L. Oliker, D. Patterson, J. Shalf, K. Yelick, "Stencil Computation Optimization and Autotuning on State-of-the-Art Multicore Architectures", Supercomputing (SC), November 2008,

S. Williams, J. Carter, L. Oliker, J. Shalf, K. Yelick, "Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms", IEEE International Symposium on Parallel and Distributed Processing, 2008. IPDPS 2008., 2008,

Shoaib Kamil, John Shalf, Erich Strohmaier, "Power Efficiency in High Performance Computing", IPDPS, 2008,

H. Shan, K. Antypas, J. Shalf, "Characterizing and Predicting the I/O Performance of HPC Applications Using a Parameterized Synthetic Benchmark", IEEE, 2008,

B.V. Straalen, J. Shalf, T. Ligocki, N. Keen, W. Yang, "Scalability Challenges for Massively Parallel AMR Applications", IPDPS2009, 2008,

William T.C. Kramer, John M. Shalf, E. Wes Bethel, D. Agarwal, Michael Banda, John Hules, Juan C. Meza, Leonid Oliker, Horst Simon, David Skinner, Francesca Verdier, Howard Walter, Michael Wehner, and Katherine Yelick, "HPC in 2016: A View Point from NERSC", Proceedings of the Cray User Group Conference, Helsinki, Finland, 2008,

S. Williams, L. Oliker, R. Vuduc, J. Shalf, K. Yelick, J. Demmel, "Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms", Supercomputing (SC), November 2007,

J. Borrill, L. Oliker. J. Shalf, H. Shan, "Investigation Of Leading HPC I/O Performance Using A Scientific-Application Derived Benchmark", Supercomputing (SC), 2007,

S. Kamil, A. Pinar, D. Gunter, M. Lijewski, L. Oliker, J. Shalf, "Reconfigurable Hybrid Interconnection for Static and Dynamic Scientific Applications", ACM, International Conference on Computing Frontiers, 2007, LBNL 60060,

L. Oliker, A. Canning, J. Carter, C. Iancu, M. Lijewski, S. Kamil, J. Shalf, H. Shan, E. Strohmaier, S. Ethier, T. Goodale, "Scientific Application Performance on Candidate PetaScale Platforms", IEEE International Parallel and Distributed Processing Symposium, 2007. IPDPS 2007., 2007, 1--12, doi: 10.1109/IPDPS.2007.370259

L. Oliker, J. Shalf, M. Wehner, "Climate Modeling at the Petaflop Scale using Semi-Custom Computing", SIAM Conference on Computational Science and Engineering, 2007,

J. Carter, L. Oliker, J. Shalf, "Performance Evaluation of Scientific Applications on Modern Parallel Vector Systems", Extended Version: Lecture Notes in Computer Science, 2007,

J. Carter, Y. He, J. Shalf, H. Shan, E. Strohmaier, H. Wasserman, "The Performance Effect of Multi-core on Scientific Applications", Proceedings of Cray User Group, 2007, LBNL 62662,

Hongzhang Shan and John Shalf, "Using IOR to Analyze the I/O performance for HPC Platforms", CUG.org, 2007, LBNL 62647,

S. Kamil, K. Datta, S, Williams, L. Oliker, J. Shalf, K. Yelick, "Implicit and Explicit Optimizations for Stencil Computations", Memory Systems Performance and Correctness (MSPC), 2006,

S. Williams, J. Shalf, L. Oliker, P. Husbands, S. Kamil, K. Yelick, "The Potential of the Cell Processor for Scientific Computing", ACM International Conference on Computing Frontiers, 2006,

J. Carter, L. Oliker, J. Shalf, "Performance Evaluation of Scientific Applications on Modern Parallel Vector Systems", VECPAR, 2006,

J. Carter, L. Oliker, J. Shalf, "Performance Evaluation of Scientific Applications on Modern Parallel Vector Systems", High Performance Computing for Computational Science., 2006,

Highest Ranked Conference Paper

Luke Gosink, John Shalf, Kurt Stockinger, Wu, Wes Bethel, "HDF5-FastQuery: Accelerating Complex Queries on Datasets using Fast Bitmap Indices", SSDBM 2006, Vienna, Austria, July 2006, IEEE Computer Society Press., 2006, 149--158,

J. Shalf, S. Kamil, L. Oliker, D. Skinner, "Analyzing Ultra-Scale Application Communication Requirements for a Reconfigurable Hybrid Interconnect", Supercomputing (SC), 2005,

S. Kamil, J. Shalf, L. Oliker, D. Skinner,, "Understanding Ultra-Scale Application Communication Requirements", IEEE International Symposium on Workload Characterization (IISWC), 2005,

S. Kamil, P. Husbands, L. Oliker, J. Shalf, K. Yelick, "Impact of Modern Memory Subsystems on Cache Optimizations for Stencil Computations", 3rd Annual ACM SIGPLAN Workshop on Memory Systems Performance (MSP), 2005,

Horst Simon, William Kramer, William Saphir, John Shalf, David Bailey, Leonid Oliker, Michael Banda, C. William McCurdy, John Hules, Andrew Canning, Marc Day, Philip Colella, David Serafini, Michael Wehner, Peter Nugent, "Science-Driven System Architecture: A New Process for Leadership Class Computing", Journal of the Earth Simulator, Volume 2., 2005, LBNL 56545,

Kurt Stockinger, John Shalf, Wes Bethel, Wu, "Query-Driven Visualization of Large Data Sets", IEEE Visualization 2005, Minneapolis, MN, October 2005, 2005, 22,

Leonid Oliker, Andrew Canning, Jonathan Carter, John Shalf, Stephane Ethier, "Scientific Computations on Modern Parallel Vector Systems", SC 04, Washington, DC, USA, IEEE Computer Society, 2004, 10, doi: 10.1109/SC.2004.54

G. Griem, L. Oliker, J. Shalf K. Yelick, "Identifying Performance Bottlenecks on Modern Microarchitectures using an Adaptable Probe", 3rd International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS), 2004,

Leonid Oliker, Andrew Canning, Jonathan Carter, John Shalf, David Skinner, Stephane Ethier, Rupak Biswas, Jahed Djomehri, Rob Van der Wijngaart, "Evaluation of Cache-based Superscalar and Cacheless Vector Architectures for Scientific Computations", SC 03, New York, NY, USA, ACM, 2003, 38, doi: 10.1145/1048935.1050213

Book Chapters

John Shalf, David Donofrio, Chris Rowen, Leonid Oliker, Michael Wehner, "Green Flash: Climate Machine (LBNL)", The Encyclopedia of Parallel Computing, (Springer: 2011)

Green Flash is a research project focused on an application-driven manycore chip design that leverages commodity-embedded circuit designs and hardware/software codesign processes to create a highly programmable and energy-efficient HPC design. The project demonstrates how a multidisciplinary hardware/software codesign process that facilitates close interactions between applications scientists, computer scientists, and hardware engineers can be used to develop a system tailored for the requirements of scientific computing.

K. Datta, S. Williams, V. Volkov, J. Carter, L. Oliker, J. Shalf, K. Yelick, "Auto-tuning Stencil Computations on Multicore and Accelerators", chapter in Scientific Computing with Multicore and Accelerators, edited by Jack Dongarra, David A. Bader, ( 2010)

K. Datta, S. Williams, V. Volkov, J. Carter, L. Oliker, J. Shalf, K. Yelick, "Auto-Tuning Memory-Intensive Kernels for Multicore", Chapman & Hall/CRC Computational Science, edited by D. H. Bailey, R. F. Lucas, S. W. Williams, (CRC Press: 2010) Pages: 219

L. Oliker, A. Canning, J. Carter, C. Iancu, M. Lijewski, S. Kamil, J. Shalf, H. Shan, E. Strohmaier, S. Ethier, T. Goodale, "Performance Characteristics of Potential Petascale Scientific Applications", Petascale Computing: Algorithms and Applications. Chapman & Hall/CRC Computational Science Series (Hardcover), edited by David A. Bader, ( 2007)

Chapter

Presentation/Talks

John Shalf, Erik Schnetter, Gabrielle Allen, Edward Seidel, Cactus and the Role of Frameworks in Complex Multiphysics HPC Applications, 2009,

John Shalf, Auto-Tuning: The Big Questions (Panel), 2009,

John Shalf, David Donofrio, Green Flash: Extreme Scale Computing on a Petascale Budget, 2009,

John Shalf, Challenges of Energy Efficient Scientific Computing, 2009,

John Shalf, Harvey Wasserman, Breakthrough Computing in Petascale Applications and Petascale System Examples at NERSC, 2009,

John Shalf, Satoshi Matsuoka, IESP Power Efficiency Research Priorities, 2009,

S. Williams, et al., The Roofline Model: A Pedagogical Tool for Auto-tuning Kernels on Multicore Architectures, Hot Chips 20, August 10, 2008,

John Shalf, Honzhan Shan, Katie Antypas, I/O Requirements for HPC Applications, 2008,

John Shalf, NERSC User IO Cases, 2008,

Antypas, K. Shalf, J., and Wasserman, H., Recent Workload Characterization Activities at NERSC, 2008,

John Shalf, Neuroinformatics Congress: Future Hardware Challenges for Scientific Computing, 2008,

M. Wehner, L. Oliker, J. Shalf, Ultra-Efficient Exascale Scientific Computing, 2008,

John Shalf, Landscape of Computing Architecture: Introduction to the "Berkeley View, 2007,

John Shalf, About Memory Bandwidth and Multicore, 2007,

John Shalf, The Landscape of Parallel Computing Architecture., 2007,

John Shalf, Overturning the Conventional Wisdom for the Multicore Era: Everything You Know is Wrong, 2007,

John Shalf, Honzhang Shan, User Perspective on HPC I/O Requirements., 2007,

John Shalf, NERSC Workload Analysis, 2007,

John Shalf, NERSC Power Efficiency Analysis., 2007,

John Shalf, Memory Subsystem Performance and QuadCore Predictions, 2007,

John Shalf, Shoaib Kamil, David Skinner, Leonid Oliker, Interconnect Requirements for HPC Applications, 2007,

John Shalf, Shoaib Kamil, David Bailey, Erich Strohmaier, Power Efficiency and the Top500, 2007,

John Shalf, Power, Cooling, and Energy Consumption for the Petascale and Beyond., 2007,

John Shalf, Petascale Computing Application Challenges., 2007,

Leonid Oliker, Julian Borrill, Hongzhang Shan, John Shalf, Investigation Of Leading HPC I/O Performance Using A Scientific-Application Derived Benchmark., 2007,

Shoaib Kamil, John Shalf, Power Efficiency Metrics for the Top500, 2007,

John Shalf, David Bailey, Top500 Power Efficiency, 2006,

Reports

M. Christen, N. Keen, T. Ligocki, L. Oliker, J. Shalf, B. van Straalen, S. Williams, "Automatic Thread-Level Parallelization in the Chombo AMR Library", LBNL Technical Report, 2011,

S. Amarasinghe, D. Campbell, W. Carlson, A. Chien, W. Dally, E. Elnohazy, M. Hall, R. Harrison, W. Harrod, K. Hill, J. Hiller, S. Karp, C. Koelbel, D. Koester, P. Kogge, J. Levesque, D. Reed, V. Sarkar, R. Schreiber, M. Richards, A. Scarpelli, J. Shalf , A. Snavely, T. Sterling, "ExaScale Software Study: Software Challenges in Extreme Scale Systems", 2009,

John Shalf, Thomas Sterling, "Operating Systems For Exascale Computing", 2009,

Gabrielle Allen (LSU/CCT), Gene Allen (MSC Inc.), Kenneth Alvin (SNL), Matt Drahzal (IBM), David Fisher (DoD-Mod), Robert Graybill (USC/ISI), Bob Lucas (USC/ISI), Tim Mattson (Intel), Hal Morgan (SNL), Erik Schnetter (LSU/CCT), Brian Schott (USC/ISI), Edward Seidel (LSU/CCT), John Shalf (LBNL/NERSC), Shawn Shamsian (MSC Inc.), David Skinner (LBNL/NERSC), Siu Tong (Engeneous) (2008), "Frameworks for Multiphysics Simulation : HPC Application Software Consortium Summit Concept Paper.", 2008,

Antypas, K., Shalf, J., and Wasserman, H., "NERSC-6 Workload Analysis and Benchmark Selection Process", 2008, LBNL 1014E,

J. Levesque, J. Larkin, M. Foster, J. Glenski, G. Geissler, S. Whalen, B. Waldecker, J. Carter, D. Skinner, Y. He, H. Wasserman, J. Shalf, H. Shan, E. Strohmaier, "Understanding and Mitigating Multicore Performance Issues on the AMD Opteron Architecture", 2007, LBNL 62500,

Shoaib Kamil, John Shalf, "Measuring Power Efficiency of NERSC's Newest Flagship Machine", 2007,

K. Asanovic, R. Bodik, B. Catanzaro, J. Gebis, P. Husbands, K. Keutzer, D. Patterson, W. Plishker, J. Shalf, S. Williams, K. Yelick, "The Landscape of Parallel Computing Research: A View from Berkeley", EECS Technical Report, December 2006,

Hongzhang Shan, John Shalf, "Analysis of Parallel IO on Modern HPC Platforms", 2006,

  • Download File: IOR.doc (doc: 399 KB)

Analysis of the parallel IO requirements from a number of HPC applications, combined with microbenchmarks to aid in understanding their performance.

W. Kramer, J. Carter, D. Skinner, L. Oliker, P. Husbands, P. Hargrove, J. Shalf, O. Marques, E. Ng, A. Drummond, K. Yelick, "Software Roadmap to Plug and Play Petaflop/s", 2006,

Horst Simon, William Kramer, William Saphir, John Shalf, David Bailey, Leonid Oliker, Michael Banda, C. William McCurdy, John Hules, Andrew Canning, Marc Day, Philip Colella, David Serafini, Michael Wehner and Peter Nugent, "Science-Driven System Architecture: A New Process for Leadership Class Computing", J. of the Earth Simulator, Vol.2 (2005), pp. 2-10, October 2005, LBNL 56465,

S. Williams, J. Shalf, L. Oliker, P. Husbands, K. Yelick, "Dense and Sparse Matrix Operations on the Cell Processor", LBNL Technical Report, 2005,

Ryne, R., Abell, D., Adelmann, A., Admundson, J., Bohn, C., Cary, J., Colella, P., Dechow, D., Decyk, V., Dragt, A., Gerber, R., Habib, S., Higdon, D., Katsouleas, T., Ta, K.L., McCorquodale, P., Mihalcea, D., Mitchell, C., Mori, W., Mottershead, C.T., Neri, F., Pogorelov, I., Quiang, J., Samulyak, R., Serafini, D., Shalf, J., Siegerist, C., Spentzouris, P., Stoltz, P., Terzic, B., Venturini, M., Walstrom, P., "SciDAC Advances and Applications in Computational Beam Dynamics", June 2005, LBNL 58243,

Simon, H., Kramer, W., Saphir, W., Shalf, J., Bailey, D., Oliker, L., Banda, M., McCurdy, C.W., Hules, J., Canning, A., Day, M., Colella, P., Serafini, D., Wehner, M., Nugent, P., "National Facility for Advanced Computational ScienceL A Sustainable Path to Scientific Discovery", April 2004, LBNL 5500,

Posters

S. Williams, J. Carter, J. Demmel, L. Oliker, D. Patterson, J. Shalf, K. Yelick, R. Vuduc, "Autotuning Scientific Kernels on Multicore Systems", ASCR PI Meeting, 2008,

Others

Didem Unat, Cy Chan, Weiqun Zhang, John Bell and John Shalf, Tiling as a Durable Abstraction for Parallelism and Data Locality, Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, November 18, 2013,