For Deep Learning performance, please go here.


Modern HPC data centers are key to solving some of the world’s most important scientific and engineering challenges. The NVIDIA Data Center GPUs fundamentally change the economics of the data center, delivering breakthrough performance with dramatically fewer servers, less power consumption, and reduced networking overhead, resulting in total cost savings of 5X-10X.

The number of CPU-only servers replaced by a single GPU-accelerated server is called the node replacement factor (NRF). To arrive at NRF, we measure application performance with up to 8 CPU-only servers. Then we use linear scaling to scale beyond 8 servers to calculate the NRF. The NRF will vary by application.


Detailed GB200 application performance data is located below in alphabetical order.

AMBER

Molecular Dynamics

Suite of programs to simulate molecular dynamics on biomolecule

VERSION

24-AT_24

ACCELERATED FEATURES

  • PMEMD Explicit Solvent and GB Implicit Solvent

SCALABILITY

Multi-GPU and Single Node

MORE INFORMATION

http://ambermd.org/GPUSupport.php

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
AMBER [PME-Cellulose_NPT_4fs]ns/dayDC-Cellulose_NPTyes11.718751,763
AMBER [PME-Cellulose_NPT_4fs]NRFDC-Cellulose_NPTyes1x75x151x
AMBER [PME-Cellulose_NVE_4fs]ns/dayDC-Cellulose_NVEyes11.698871,780
AMBER [PME-Cellulose_NVE_4fs]NRFDC-Cellulose_NVEyes1x76x152x
AMBER [PME-FactorIX_NPT_4fs]ns/dayDC-FactorIX_NPTyes93.363,7767,557
AMBER [PME-FactorIX_NPT_4fs]NRFDC-FactorIX_NPTyes1x40x81x
AMBER [PME-FactorIX_NVE_4fs]ns/dayDC-FactorIX_NVEyes99.503,8537,767
AMBER [PME-FactorIX_NVE_4fs]NRFDC-FactorIX_NVEyes1x39x78x
AMBER [PME-JAC_NPT_4fs]ns/dayDC-JAC_NPTyes377.0411,32021,833
AMBER [PME-JAC_NPT_4fs]NRFDC-JAC_NPTyes1x30x58x
AMBER [PME-JAC_NVE_4fs]ns/dayDC-JAC_NVEyes397.0411,92822,814
AMBER [PME-JAC_NVE_4fs]NRFDC-JAC_NVEyes1x30x57x
AMBER [PME-STMV_NPT_4fs]ns/dayDC-STMV_NPTyes3.69236472
AMBER [PME-STMV_NPT_4fs]NRFDC-STMV_NPTyes1x64x128x
AMBER [FEP-GTI_Complex 1fs]ns/dayFEP-GTI_Complexyes25.07386772
AMBER [FEP-GTI_Complex 1fs]NRFFEP-GTI_Complexyes1x15x31x

AMBER is measured by running multiple independent instances using MPS


Chroma

Physics

Lattice Quantum Chromodynamics (LQCD)

VERSION

V2025.09

ACCELERATED FEATURES

  • Wilson-clover fermions, Krylov solvers, Domain-decomposition
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
ChromaFinal Timestep Time (Sec)HMC Mediumno10,0377249
ChromaNRFHMC Mediumyes1x142x211x

FUN3D

Engineering

Suite of tools actively developed at NASA for Aeronautics and Space Technology by modeling fluid flow

VERSION

14.2

ACCELERATED FEATURES

  • Full range of Mach number regimes for the Reynolds-averaged Navier Stokes (RANS) formulation

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://fun3d.larc.nasa.gov

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
Fun3D [waverider-5M]Loop Time (Sec)waverider-5Mno1701410
Fun3D [waverider-5M]NRFwaverider-5Myes1x19x27x
Fun3D [waverider-5M w/chemistry]Loop Time (Sec)waverider-5M w/chemistryno4383923
Fun3D [waverider-5M w/chemistry]NRFwaverider-5M w/chemistryyes1x17x28x
Fun3D [waverider-20M w/chemistry]Loop Time (Sec)waverider-20M w/chemistryno1,92715479
Fun3D [waverider-20M w/chemistry]NRFwaverider-20M w/chemistryyes1x18x36x

GROMACS

Molecular Dynamics

Simulation of biochemical molecules with complicated bond interactions

VERSION

2025.3

ACCELERATED FEATURES

  • Implicit (5x), Explicit (2x) Solvent
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
GROMACS [Cellulose]ns/dayCelluloseyes79347575
GROMACS [Cellulose]NRFCelluloseyes1x4x7x
GROMACS [STMV]ns/daySTMVyes20100145
GROMACS [STMV]NRFSTMVyes1x6x10x

GTC

Physics

GTC is used for Gyrokinetic Particle Simulation of Turbulent Transport in Burning Plasmas

VERSION

V4.5 updated

ACCELERATED FEATURES

  • Push, shift, and collision

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
GTCMpush/Secmpi#proc.inyes1461,8393,581
GTCNRFmpi#proc.inyes1x13x26x

LAMMPS

Molecular Dynamics

Classical molecular dynamics package

VERSION

stable_22July2025_update1

ACCELERATED FEATURES

  • Lennard-Jones, Gay-Berne, Tersoff, many more potentials
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
LAMMPS [LJ 2.5]ATOM-Time Steps/sLJ 2.5yes3.83E+083.44E+095.63E+09
LAMMPS [LJ 2.5]NRFLJ 2.5yes1x9x15x
LAMMPS [EAM]ATOM-Time Steps/sEAMyes1.43E+081.32E+092.17E+09
LAMMPS [EAM]NRFEAMyes1x10x16x
LAMMPS [ReaxFF/C]ATOM-Time Steps/sReaxFF/Cyes1.90E+062.37E+073.41E+07
LAMMPS [ReaxFF/C]NRFReaxFF/Cyes1x19x27x
LAMMPS [Rhodopsin]ATOM-Time Steps/sRhodopsinyes1.50E+076.66E+071.00E+08
LAMMPS [Rhodopsin]NRFRhodopsinyes1x5x10x
LAMMPS [SNAP]ATOM-Time Steps/sSNAPyes8.40E+051.54E+073.00E+07
LAMMPS [SNAP]NRFSNAPyes1x21x41x
LAMMPS [Tersoff]ATOM-Time Steps/sTersoffyes7.67E+072.39E+094.09E+09
LAMMPS [Tersoff]NRFTersoffyes1x37x63x

MILC

Physics

Lattice Quantum Chromodynamics (LQCD) codes simulate how elemental particles are formed and bound by the “strong force” to create larger particles like protons and neutrons

VERSION

develop_f803f4bf

ACCELERATED FEATURES

  • Staggered fermions, Krylov solvers, Gauge-link fattening

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

https://ngc.nvidia.com/catalog/containers/hpc:milc

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
MILCTotal Time (sec)Apex Mediumno13,918338229
MILCNRFApex Mediumyes1x37x54x

NAMD

Molecular Dynamics

Designed for high-performance simulation of large molecular systems

VERSION

3.1.a3

ACCELERATED FEATURES

  • Full electrostatics with PME and most simulation features

SCALABILITY

Up to 100M atom capable, multi-GPU, single node

MORE INFORMATION

http://www.ks.uiuc.edu/Research/namd/

https://ngc.nvidia.com/catalog/containers/hpc:namd

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
NAMD [apoa1_npt_cuda]ns/dayapoa1_npt_cudayes100.721,2332,437
NAMD [apoa1_npt_cuda]NRFapoa1_npt_cudayes1x12x24x
NAMD [LaINDY ColVars]ns/dayLaINDY ColVarsyes42.90212447
NAMD [LaINDY ColVars]NRFLaINDY ColVarsyes1x5x10x
NAMD [apoa1_nve_cuda]ns/dayapoa1_nve_cudayes108.799741,928
NAMD [apoa1_nve_cuda]NRFapoa1_nve_cudayes1x9x18x
NAMD [stmv_npt_cuda]ns/daystmv_npt_cudayes10.5363125
NAMD [stmv_npt_cuda]NRFstmv_npt_cudayes1x6x12x
NAMD [COVID-19 Spike Assembly]ns/dayCOVID-19 Spike Assemblyyes0.79815
NAMD [COVID-19 Spike Assembly]NRFCOVID-19 Spike Assemblyyes1x10x18x
NAMD [stmv_nve_cuda]ns/daystmv_nve_cudayes10.8779158
NAMD [stmv_nve_cuda]NRFstmv_nve_cudayes1x7x15x

NAMD is measured by running multiple independent instances using MPS
Trifan A, Gorgun D, Salim M, et al. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action. The International Journal of High Performance Computing Applications. 2022;36(5-6):603-623. doi:10.1177/10943420221113513
D. B. Sauer, N. Trebesch, J. J. Marden, N. Cocco, J. Song, A. Koide, S. Koide, E. Tajkhorshid, and D.-N. Wang. "Structural basis for the reaction cycle of DASS dicarboxylate transporters." eLife. 9, e61350 (2020). https://doi.org/10.7554/eLife.61350


RTM

Geoscience

Reverse time migration (RTM) modeling is a critical component in the seismic processing workflow of oil and gas exploration

VERSION

nvidia_2025_11

ACCELERATED FEATURES

  • Batch algorithm

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.tsunamidevelopment.com/assets/rtm.pdf

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
RTM [Isotropic Radius 4]Mcell/sIsotropic Radius 4yes21,047530,0851,056,797
RTM [Isotropic Radius 4]NRFIsotropic Radius 4yes1x25x50x
RTM [TTI Radius 8 1-pass]Mcell/sTTI Radius 8 1-passyes7,21374,262145,532
RTM [TTI Radius 8 1-pass]NRFTTI Radius 8 1-passyes1x10x20x
RTM [TTI RX 2Pass mgpu]Mcell/sTTI RX 2Pass mgpuyes7,21379,650158,197
RTM [TTI RX 2Pass mgpu]NRFTTI RX 2Pass mgpuyes1x11x22x

SPECFEM3D

Geoscience

Simulates Seismic wave propagation

VERSION

4.1.1

ACCELERATED FEATURES

  • OpenCL and CUDA hardware accelerators, based on an automatic source-to-source transformation library

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://geodynamics.org/cig/software/specfem3d/

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x GB2004x GB200
SPECFEM3DTotal Time (Sec)four_material_simple_modelno1861710
SPECFEM3DNRFfour_material_simple_modelyes1x12x21x


Detailed B200 application performance data is located below in alphabetical order.

AMBER

Molecular Dynamics

Suite of programs to simulate molecular dynamics on biomolecule

VERSION

24-AT_24

ACCELERATED FEATURES

  • PMEMD Explicit Solvent and GB Implicit Solvent

SCALABILITY

Multi-GPU and Single Node

MORE INFORMATION

http://ambermd.org/GPUSupport.php

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
AMBER [PME-Cellulose_NPT_4fs]ns/dayDC-Cellulose_NPTyes11.718281,7553,515
AMBER [PME-Cellulose_NPT_4fs]NRFDC-Cellulose_NPTyes1x71x150x300x
AMBER [PME-Cellulose_NVE_4fs]ns/dayDC-Cellulose_NVEyes11.698371,7534,141
AMBER [PME-Cellulose_NVE_4fs]NRFDC-Cellulose_NVEyes1x72x150x354x
AMBER [PME-FactorIX_NPT_4fs]ns/dayDC-FactorIX_NPTyes93.363,6087,26716,229
AMBER [PME-FactorIX_NPT_4fs]NRFDC-FactorIX_NPTyes1x39x78x174x
AMBER [PME-FactorIX_NVE_4fs]ns/dayDC-FactorIX_NVEyes99.503,6847,37915,697
AMBER [PME-FactorIX_NVE_4fs]NRFDC-FactorIX_NVEyes1x37x74x158x
AMBER [PME-JAC_NPT_4fs]ns/dayDC-JAC_NPTyes377.0411,39423,32948,181
AMBER [PME-JAC_NPT_4fs]NRFDC-JAC_NPTyes1x30x62x128x
AMBER [PME-JAC_NVE_4fs]ns/dayDC-JAC_NVEyes397.0411,83023,98047,800
AMBER [PME-JAC_NVE_4fs]NRFDC-JAC_NVEyes1x30x60x120x
AMBER [PME-STMV_NPT_4fs]ns/dayDC-STMV_NPTyes3.69216432864
AMBER [PME-STMV_NPT_4fs]NRFDC-STMV_NPTyes1x59x117x234x
AMBER [FEP-GTI_Complex 1fs]ns/dayFEP-GTI_Complexyes25.073847681,536
AMBER [FEP-GTI_Complex 1fs]NRFFEP-GTI_Complexyes1x15x31x61x

AMBER is measured by running multiple independent instances using MPS


Chroma

Physics

Lattice Quantum Chromodynamics (LQCD)

VERSION

V2025.09

ACCELERATED FEATURES

  • Wilson-clover fermions, Krylov solvers, Domain-decomposition
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
ChromaFinal Timestep Time (Sec)HMC Mediumno10,037734633
ChromaNRFHMC Mediumyes1x140x222x311x

FUN3D

Engineering

Suite of tools actively developed at NASA for Aeronautics and Space Technology by modeling fluid flow

VERSION

14.2

ACCELERATED FEATURES

  • Full range of Mach number regimes for the Reynolds-averaged Navier Stokes (RANS) formulation

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://fun3d.larc.nasa.gov

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
Fun3D [waverider-5M]Loop Time (Sec)waverider-5Mno1701398
Fun3D [waverider-5M]NRFwaverider-5Myes1x20x30x32x
Fun3D [waverider-5M w/chemistry]Loop Time (Sec)waverider-5M w/chemistryno438392215
Fun3D [waverider-5M w/chemistry]NRFwaverider-5M w/chemistryyes1x16x29x44x
Fun3D [waverider-20M]Loop Time (Sec)waverider-20Mno628472516
Fun3D [waverider-20M]NRFwaverider-20Myes1x18x34x52x
Fun3D [waverider-20M w/chemistry]Loop Time (Sec)waverider-20M w/chemistryno1,9271608145
Fun3D [waverider-20M w/chemistry]NRFwaverider-20M w/chemistryyes1x18x35x63x

GROMACS

Molecular Dynamics

Simulation of biochemical molecules with complicated bond interactions

VERSION

2025.3

ACCELERATED FEATURES

  • Implicit (5x), Explicit (2x) Solvent
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
GROMACS [ADH Dodec]ns/dayADH Dodecyes3331,9972,9905,933
GROMACS [ADH Dodec]NRFADH Dodecyes1x6x9x18x
GROMACS [Cellulose]ns/dayCelluloseyes79329514598
GROMACS [Cellulose]NRFCelluloseyes1x4x7x8x
GROMACS [STMV]ns/daySTMVyes2089155247
GROMACS [STMV]NRFSTMVyes1x5x10x16x

GTC

Physics

GTC is used for Gyrokinetic Particle Simulation of Turbulent Transport in Burning Plasmas

VERSION

V4.5 updated

ACCELERATED FEATURES

  • Push, shift, and collision

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
GTCMpush/Secmpi#proc.inyes1461,8443,6787,284
GTCNRFmpi#proc.inyes1x13x27x53x

LAMMPS

Molecular Dynamics

Classical molecular dynamics package

VERSION

stable_22July2025_update1

ACCELERATED FEATURES

  • Lennard-Jones, Gay-Berne, Tersoff, many more potentials
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
LAMMPS [LJ 2.5]ATOM-Time Steps/sLJ 2.5yes3.83E+083.11E+095.40E+098.52E+09
LAMMPS [LJ 2.5]NRFLJ 2.5yes1x9x15x23x
LAMMPS [EAM]ATOM-Time Steps/sEAMyes1.43E+081.25E+092.17E+093.34E+09
LAMMPS [EAM]NRFEAMyes1x9x16x25x
LAMMPS [ReaxFF/C]ATOM-Time Steps/sReaxFF/Cyes1.90E+062.33E+073.56E+074.96E+07
LAMMPS [ReaxFF/C]NRFReaxFF/Cyes1x18x28x39x
LAMMPS [Rhodopsin]ATOM-Time Steps/sRhodopsinyes1.50E+076.34E+071.03E+081.22E+08
LAMMPS [Rhodopsin]NRFRhodopsinyes1x5x10x12x
LAMMPS [SNAP]ATOM-Time Steps/sSNAPyes8.40E+051.45E+072.89E+075.67E+07
LAMMPS [SNAP]NRFSNAPyes1x20x40x78x
LAMMPS [Tersoff]ATOM-Time Steps/sTersoffyes7.67E+072.17E+093.93E+096.52E+09
LAMMPS [Tersoff]NRFTersoffyes1x33x60x100x

MILC

Physics

Lattice Quantum Chromodynamics (LQCD) codes simulate how elemental particles are formed and bound by the “strong force” to create larger particles like protons and neutrons

VERSION

develop_f803f4bf

ACCELERATED FEATURES

  • Staggered fermions, Krylov solvers, Gauge-link fattening

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

https://ngc.nvidia.com/catalog/containers/hpc:milc

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
MILCTotal Time (sec)Apex Mediumno13,918399251172
MILCNRFApex Mediumyes1x31x49x72x

NAMD

Molecular Dynamics

Designed for high-performance simulation of large molecular systems

VERSION

3.1.a3

ACCELERATED FEATURES

  • Full electrostatics with PME and most simulation features

SCALABILITY

Up to 100M atom capable, multi-GPU, single node

MORE INFORMATION

http://www.ks.uiuc.edu/Research/namd/

https://ngc.nvidia.com/catalog/containers/hpc:namd

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
NAMD [apoa1_npt_cuda]ns/dayapoa1_npt_cudayes100.721,1862,3174,566
NAMD [apoa1_npt_cuda]NRFapoa1_npt_cudayes1x12x23x45x
NAMD [LaINDY ColVars]ns/dayLaINDY ColVarsyes42.90204408810
NAMD [LaINDY ColVars]NRFLaINDY ColVarsyes1x5x10x19x
NAMD [apoa1_nve_cuda]ns/dayapoa1_nve_cudayes108.799201,8433,648
NAMD [apoa1_nve_cuda]NRFapoa1_nve_cudayes1x8x17x34x
NAMD [stmv_npt_cuda]ns/daystmv_npt_cudayes10.5360120240
NAMD [stmv_npt_cuda]NRFstmv_npt_cudayes1x6x11x23x
NAMD [COVID-19 Spike Assembly]ns/dayCOVID-19 Spike Assemblyyes0.7981422
NAMD [COVID-19 Spike Assembly]NRFCOVID-19 Spike Assemblyyes1x10x17x28x
NAMD [stmv_nve_cuda]ns/daystmv_nve_cudayes10.8775151302
NAMD [stmv_nve_cuda]NRFstmv_nve_cudayes1x7x14x28x

NAMD is measured by running multiple independent instances using MPS
Trifan A, Gorgun D, Salim M, et al. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action. The International Journal of High Performance Computing Applications. 2022;36(5-6):603-623. doi:10.1177/10943420221113513
D. B. Sauer, N. Trebesch, J. J. Marden, N. Cocco, J. Song, A. Koide, S. Koide, E. Tajkhorshid, and D.-N. Wang. "Structural basis for the reaction cycle of DASS dicarboxylate transporters." eLife. 9, e61350 (2020). https://doi.org/10.7554/eLife.61350


Quantum Espresso

Material Science (Quantum Chemistry)

An Open-source suite of computer codes for electronic structure calculations and materials modeling at the nanoscale

VERSION

V7.5

ACCELERATED FEATURES

  • linear algebra (matrix multiply)
  • explicit computational kernels
  • 3D FFTs

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.quantum-espresso.org

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
Quantum EspresssoTotal CPU Time (Sec)GRIR443no7841128146
Quantum EspresssoNRFGRIR443yes1x13x18x31x

RTM

Geoscience

Reverse time migration (RTM) modeling is a critical component in the seismic processing workflow of oil and gas exploration

VERSION

nvidia_2025_11

ACCELERATED FEATURES

  • Batch algorithm

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.tsunamidevelopment.com/assets/rtm.pdf

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
RTM [Isotropic Radius 4]Mcell/sIsotropic Radius 4yes21,047518,2571,036,5872,080,893
RTM [Isotropic Radius 4]NRFIsotropic Radius 4yes1x25x49x99x
RTM [TTI Radius 8 1-pass]Mcell/sTTI Radius 8 1-passyes7,21370,437140,655280,191
RTM [TTI Radius 8 1-pass]NRFTTI Radius 8 1-passyes1x10x20x39x
RTM [TTI RX 2Pass mgpu]Mcell/sTTI RX 2Pass mgpuyes7,21375,956151,108300,003
RTM [TTI RX 2Pass mgpu]NRFTTI RX 2Pass mgpuyes1x11x21x42x

SPECFEM3D

Geoscience

Simulates Seismic wave propagation

VERSION

4.1.1

ACCELERATED FEATURES

  • OpenCL and CUDA hardware accelerators, based on an automatic source-to-source transformation library

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://geodynamics.org/cig/software/specfem3d/

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x B2004x B2008x B200
SPECFEM3DTotal Time (Sec)four_material_simple_modelno18618108
SPECFEM3DNRFfour_material_simple_modelyes1x12x21x26x


Detailed RTX PRO 6000 Blackwell Server Edition application performance data is located below in alphabetical order.

AMBER

Molecular Dynamics

Suite of programs to simulate molecular dynamics on biomolecule

VERSION

24-AT_24

ACCELERATED FEATURES

  • PMEMD Explicit Solvent and GB Implicit Solvent

SCALABILITY

Multi-GPU and Single Node

MORE INFORMATION

http://ambermd.org/GPUSupport.php

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
AMBER [PME-Cellulose_NPT_4fs]ns/dayDC-Cellulose_NPTyes11.716641,2932,656
AMBER [PME-Cellulose_NPT_4fs]NRFDC-Cellulose_NPTyes1x57x110x227x
AMBER [PME-Cellulose_NVE_4fs]ns/dayDC-Cellulose_NVEyes11.696711,3312,824
AMBER [PME-Cellulose_NVE_4fs]NRFDC-Cellulose_NVEyes1x57x114x242x
AMBER [PME-FactorIX_NPT_4fs]ns/dayDC-FactorIX_NPTyes93.363,7337,30615,599
AMBER [PME-FactorIX_NPT_4fs]NRFDC-FactorIX_NPTyes1x40x78x167x
AMBER [PME-FactorIX_NVE_4fs]ns/dayDC-FactorIX_NVEyes99.503,6707,62415,813
AMBER [PME-FactorIX_NVE_4fs]NRFDC-FactorIX_NVEyes1x37x77x159x
AMBER [PME-JAC_NPT_4fs]ns/dayDC-JAC_NPTyes377.0412,13124,01846,291
AMBER [PME-JAC_NPT_4fs]NRFDC-JAC_NPTyes1x32x64x123x
AMBER [PME-JAC_NVE_4fs]ns/dayDC-JAC_NVEyes397.0412,15424,71651,855
AMBER [PME-JAC_NVE_4fs]NRFDC-JAC_NVEyes1x31x62x131x
AMBER [PME-STMV_NPT_4fs]ns/dayDC-STMV_NPTyes3.69226452904
AMBER [PME-STMV_NPT_4fs]NRFDC-STMV_NPTyes1x61x122x245x
AMBER [FEP-GTI_Complex 1fs]ns/dayFEP-GTI_Complexyes25.073927841,568
AMBER [FEP-GTI_Complex 1fs]NRFFEP-GTI_Complexyes1x16x31x63x

AMBER is measured by running multiple independent instances using MPS


Chroma

Physics

Lattice Quantum Chromodynamics (LQCD)

VERSION

V2025.09

ACCELERATED FEATURES

  • Wilson-clover fermions, Krylov solvers, Domain-decomposition
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
ChromaFinal Timestep Time (Sec)HMC Mediumno10,03719010364
ChromaNRFHMC Mediumyes1x54x99x159x

FUN3D

Engineering

Suite of tools actively developed at NASA for Aeronautics and Space Technology by modeling fluid flow

VERSION

14.2

ACCELERATED FEATURES

  • Full range of Mach number regimes for the Reynolds-averaged Navier Stokes (RANS) formulation

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://fun3d.larc.nasa.gov

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
Fun3D [waverider-5M]Loop Time (Sec)waverider-5Mno170472616
Fun3D [waverider-5M]NRFwaverider-5Myes1x6x10x16x
Fun3D [waverider-5M w/chemistry]Loop Time (Sec)waverider-5M w/chemistryno4381417443
Fun3D [waverider-5M w/chemistry]NRFwaverider-5M w/chemistryyes1x5x9x15x
Fun3D [waverider-20M]Loop Time (Sec)waverider-20Mno628-9553
Fun3D [waverider-20M]NRFwaverider-20Myes1x-9x16x
Fun3D [waverider-20M w/chemistry]Loop Time (Sec)waverider-20M w/chemistryno1,927-310164
Fun3D [waverider-20M w/chemistry]NRFwaverider-20M w/chemistryyes1x-9x17x

GROMACS

Molecular Dynamics

Simulation of biochemical molecules with complicated bond interactions

VERSION

2025.3

ACCELERATED FEATURES

  • Implicit (5x), Explicit (2x) Solvent
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
GROMACS [ADH Dodec]ns/dayADH Dodecyes3332,2493,1606,298
GROMACS [ADH Dodec]NRFADH Dodecyes1x7x9x19x
GROMACS [Cellulose]ns/dayCelluloseyes79329397381
GROMACS [Cellulose]NRFCelluloseyes1x4x5x5x
GROMACS [STMV]ns/daySTMVyes20108166167
GROMACS [STMV]NRFSTMVyes1x6x11x11x

GTC

Physics

GTC is used for Gyrokinetic Particle Simulation of Turbulent Transport in Burning Plasmas

VERSION

V4.5 updated

ACCELERATED FEATURES

  • Push, shift, and collision

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
GTCMpush/Secmpi#proc.inyes1461,3862,7455,414
GTCNRFmpi#proc.inyes1x10x20x40x

LAMMPS

Molecular Dynamics

Classical molecular dynamics package

VERSION

stable_22July2025_update1

ACCELERATED FEATURES

  • Lennard-Jones, Gay-Berne, Tersoff, many more potentials
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
LAMMPS [ReaxFF/C]ATOM-Time Steps/sReaxFF/Cyes1.90E+06-6.96E+061.12E+07
LAMMPS [ReaxFF/C]NRFReaxFF/Cyes1x-5x9x
LAMMPS [SNAP]ATOM-Time Steps/sSNAPyes8.40E+05-3.78E+067.29E+06
LAMMPS [SNAP]NRFSNAPyes1x-6x10x
LAMMPS [Tersoff]ATOM-Time Steps/sTersoffyes7.67E+073.06E+085.89E+081.11E+09
LAMMPS [Tersoff]NRFTersoffyes1x4x9x17x

MILC

Physics

Lattice Quantum Chromodynamics (LQCD) codes simulate how elemental particles are formed and bound by the “strong force” to create larger particles like protons and neutrons

VERSION

develop_f803f4bf

ACCELERATED FEATURES

  • Staggered fermions, Krylov solvers, Gauge-link fattening

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

https://ngc.nvidia.com/catalog/containers/hpc:milc

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
MILCTotal Time (sec)Apex Mediumno13,9181,194635385
MILCNRFApex Mediumyes1x10x19x32x

NAMD

Molecular Dynamics

Designed for high-performance simulation of large molecular systems

VERSION

3.1.a3

ACCELERATED FEATURES

  • Full electrostatics with PME and most simulation features

SCALABILITY

Up to 100M atom capable, multi-GPU, single node

MORE INFORMATION

http://www.ks.uiuc.edu/Research/namd/

https://ngc.nvidia.com/catalog/containers/hpc:namd

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
NAMD [apoa1_npt_cuda]ns/dayapoa1_npt_cudayes100.721,1482,2534,517
NAMD [apoa1_npt_cuda]NRFapoa1_npt_cudayes1x11x22x45x
NAMD [LaINDY ColVars]ns/dayLaINDY ColVarsyes42.90186372728
NAMD [LaINDY ColVars]NRFLaINDY ColVarsyes1x4x9x17x
NAMD [apoa1_nve_cuda]ns/dayapoa1_nve_cudayes108.799641,8653,697
NAMD [apoa1_nve_cuda]NRFapoa1_nve_cudayes1x9x17x34x
NAMD [stmv_npt_cuda]ns/daystmv_npt_cudayes10.5351102204
NAMD [stmv_npt_cuda]NRFstmv_npt_cudayes1x5x10x19x
NAMD [COVID-19 Spike Assembly]ns/dayCOVID-19 Spike Assemblyyes0.7958-
NAMD [COVID-19 Spike Assembly]NRFCOVID-19 Spike Assemblyyes1x6x10x-
NAMD [stmv_nve_cuda]ns/daystmv_nve_cudayes10.8774147295
NAMD [stmv_nve_cuda]NRFstmv_nve_cudayes1x7x14x27x

NAMD is measured by running multiple independent instances using MPS
Trifan A, Gorgun D, Salim M, et al. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action. The International Journal of High Performance Computing Applications. 2022;36(5-6):603-623. doi:10.1177/10943420221113513
D. B. Sauer, N. Trebesch, J. J. Marden, N. Cocco, J. Song, A. Koide, S. Koide, E. Tajkhorshid, and D.-N. Wang. "Structural basis for the reaction cycle of DASS dicarboxylate transporters." eLife. 9, e61350 (2020). https://doi.org/10.7554/eLife.61350


RTM

Geoscience

Reverse time migration (RTM) modeling is a critical component in the seismic processing workflow of oil and gas exploration

VERSION

nvidia_2025_11

ACCELERATED FEATURES

  • Batch algorithm

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.tsunamidevelopment.com/assets/rtm.pdf

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
RTM [Isotropic Radius 4]Mcell/sIsotropic Radius 4yes21,047182,716363,899727,827
RTM [Isotropic Radius 4]NRFIsotropic Radius 4yes1x9x17x35x
RTM [TTI Radius 8 1-pass]Mcell/sTTI Radius 8 1-passyes7,21361,894122,258243,087
RTM [TTI Radius 8 1-pass]NRFTTI Radius 8 1-passyes1x9x17x34x
RTM [TTI RX 2Pass mgpu]Mcell/sTTI RX 2Pass mgpuyes7,21331,98663,489126,642
RTM [TTI RX 2Pass mgpu]NRFTTI RX 2Pass mgpuyes1x4x9x18x

SPECFEM3D

Geoscience

Simulates Seismic wave propagation

VERSION

4.1.1

ACCELERATED FEATURES

  • OpenCL and CUDA hardware accelerators, based on an automatic source-to-source transformation library

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://geodynamics.org/cig/software/specfem3d/

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X (CPU-Only)2x RTX Pro 6000 BSE4x RTX Pro 6000 BSE8x RTX Pro 6000 BSE
SPECFEM3DTotal Time (Sec)four_material_simple_modelno186402112
SPECFEM3DNRFfour_material_simple_modelyes1x4x10x18x


Detailed H100 application performance data is located below in alphabetical order.

AMBER

Molecular Dynamics

Suite of programs to simulate molecular dynamics on biomolecule

VERSION

24-AT_24

ACCELERATED FEATURES

  • PMEMD Explicit Solvent and GB Implicit Solvent

SCALABILITY

Multi-GPU and Single Node

MORE INFORMATION

http://ambermd.org/GPUSupport.php

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
AMBER [PME-Cellulose_NPT_4fs]ns/dayDC-Cellulose_NPTyes11.716461,2882,588
AMBER [PME-Cellulose_NPT_4fs]NRFDC-Cellulose_NPTyes1x55x110x221x
AMBER [PME-Cellulose_NVE_4fs]ns/dayDC-Cellulose_NVEyes11.696551,3122,610
AMBER [PME-Cellulose_NVE_4fs]NRFDC-Cellulose_NVEyes1x56x112x223x
AMBER [PME-FactorIX_NPT_4fs]ns/dayDC-FactorIX_NPTyes93.362,7835,55811,044
AMBER [PME-FactorIX_NPT_4fs]NRFDC-FactorIX_NPTyes1x30x60x118x
AMBER [PME-FactorIX_NVE_4fs]ns/dayDC-FactorIX_NVEyes99.502,8415,69611,316
AMBER [PME-FactorIX_NVE_4fs]NRFDC-FactorIX_NVEyes1x29x57x114x
AMBER [PME-JAC_NPT_4fs]ns/dayDC-JAC_NPTyes377.049,45718,75235,752
AMBER [PME-JAC_NPT_4fs]NRFDC-JAC_NPTyes1x25x50x95x
AMBER [PME-JAC_NVE_4fs]ns/dayDC-JAC_NVEyes397.049,69119,38337,120
AMBER [PME-JAC_NVE_4fs]NRFDC-JAC_NVEyes1x24x49x93x
AMBER [PME-STMV_NPT_4fs]ns/dayDC-STMV_NPTyes3.69188376752
AMBER [PME-STMV_NPT_4fs]NRFDC-STMV_NPTyes1x51x102x204x
AMBER [FEP-GTI_Complex 1fs]ns/dayFEP-GTI_Complexyes25.073987961,592
AMBER [FEP-GTI_Complex 1fs]NRFFEP-GTI_Complexyes1x16x32x64x

AMBER is measured by running multiple independent instances using MPS


Chroma

Physics

Lattice Quantum Chromodynamics (LQCD)

VERSION

V2025.09

ACCELERATED FEATURES

  • Wilson-clover fermions, Krylov solvers, Domain-decomposition
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
ChromaFinal Timestep Time (Sec)HMC Mediumno10,0371086441
ChromaNRFHMC Mediumyes1x94x161x250x

FUN3D

Engineering

Suite of tools actively developed at NASA for Aeronautics and Space Technology by modeling fluid flow

VERSION

14.2

ACCELERATED FEATURES

  • Full range of Mach number regimes for the Reynolds-averaged Navier Stokes (RANS) formulation

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://fun3d.larc.nasa.gov

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
Fun3D [waverider-5M]Loop Time (Sec)waverider-5Mno17020129
Fun3D [waverider-5M]NRFwaverider-5Myes1x14x23x30x
Fun3D [waverider-5M w/chemistry]Loop Time (Sec)waverider-5M w/chemistryno438583219
Fun3D [waverider-5M w/chemistry]NRFwaverider-5M w/chemistryyes1x11x20x33x
Fun3D [waverider-20M]Loop Time (Sec)waverider-20Mno628-3823
Fun3D [waverider-20M]NRFwaverider-20Myes1x-22x37x
Fun3D [waverider-20M w/chemistry]Loop Time (Sec)waverider-20M w/chemistryno1,927-12465
Fun3D [waverider-20M w/chemistry]NRFwaverider-20M w/chemistryyes1x-23x43x

GROMACS

Molecular Dynamics

Simulation of biochemical molecules with complicated bond interactions

VERSION

2025.3

ACCELERATED FEATURES

  • Implicit (5x), Explicit (2x) Solvent
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
GROMACS [ADH Dodec]ns/dayADH Dodecyes3331,5522,5935,224
GROMACS [ADH Dodec]NRFADH Dodecyes1x5x8x16x
GROMACS [Cellulose]ns/dayCelluloseyes79251371429
GROMACS [Cellulose]NRFCelluloseyes1x3x5x5x
GROMACS [STMV]ns/daySTMVyes2075130195
GROMACS [STMV]NRFSTMVyes1x4x9x13x

GTC

Physics

GTC is used for Gyrokinetic Particle Simulation of Turbulent Transport in Burning Plasmas

VERSION

V4.5 updated

ACCELERATED FEATURES

  • Push, shift, and collision

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
GTCMpush/Secmpi#proc.inyes1461,4422,8435,699
GTCNRFmpi#proc.inyes1x11x21x42x

LAMMPS

Molecular Dynamics

Classical molecular dynamics package

VERSION

stable_22July2025_update1

ACCELERATED FEATURES

  • Lennard-Jones, Gay-Berne, Tersoff, many more potentials
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
LAMMPS [LJ 2.5]ATOM-Time Steps/sLJ 2.5yes3.83E+082.43E+094.21E+096.80E+09
LAMMPS [LJ 2.5]NRFLJ 2.5yes1x6x12x19x
LAMMPS [EAM]ATOM-Time Steps/sEAMyes1.43E+081.02E+091.79E+092.85E+09
LAMMPS [EAM]NRFEAMyes1x7x13x21x
LAMMPS [ReaxFF/C]ATOM-Time Steps/sReaxFF/Cyes1.90E+061.92E+073.02E+074.36E+07
LAMMPS [ReaxFF/C]NRFReaxFF/Cyes1x15x24x34x
LAMMPS [Rhodopsin]ATOM-Time Steps/sRhodopsinyes1.50E+075.59E+079.05E+071.12E+08
LAMMPS [Rhodopsin]NRFRhodopsinyes1x4x9x11x
LAMMPS [SNAP]ATOM-Time Steps/sSNAPyes8.40E+051.25E+072.32E+074.68E+07
LAMMPS [SNAP]NRFSNAPyes1x17x32x64x
LAMMPS [Tersoff]ATOM-Time Steps/sTersoffyes7.67E+071.86E+093.39E+095.43E+09
LAMMPS [Tersoff]NRFTersoffyes1x28x52x83x

MILC

Physics

Lattice Quantum Chromodynamics (LQCD) codes simulate how elemental particles are formed and bound by the “strong force” to create larger particles like protons and neutrons

VERSION

develop_f803f4bf

ACCELERATED FEATURES

  • Staggered fermions, Krylov solvers, Gauge-link fattening

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

https://ngc.nvidia.com/catalog/containers/hpc:milc

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
MILCTotal Time (sec)Apex Mediumno13,918662379241
MILCNRFApex Mediumyes1x19x33x51x

NAMD

Molecular Dynamics

Designed for high-performance simulation of large molecular systems

VERSION

3.1.a3

ACCELERATED FEATURES

  • Full electrostatics with PME and most simulation features

SCALABILITY

Up to 100M atom capable, multi-GPU, single node

MORE INFORMATION

http://www.ks.uiuc.edu/Research/namd/

https://ngc.nvidia.com/catalog/containers/hpc:namd

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
NAMD [apoa1_npt_cuda]ns/dayapoa1_npt_cudayes100.729821,8983,803
NAMD [apoa1_npt_cuda]NRFapoa1_npt_cudayes1x10x19x38x
NAMD [LaINDY ColVars]ns/dayLaINDY ColVarsyes42.90171341677
NAMD [LaINDY ColVars]NRFLaINDY ColVarsyes1x4x8x16x
NAMD [apoa1_nve_cuda]ns/dayapoa1_nve_cudayes108.797621,4932,976
NAMD [apoa1_nve_cuda]NRFapoa1_nve_cudayes1x7x14x27x
NAMD [stmv_npt_cuda]ns/daystmv_npt_cudayes10.534897193
NAMD [stmv_npt_cuda]NRFstmv_npt_cudayes1x5x9x18x
NAMD [COVID-19 Spike Assembly]ns/dayCOVID-19 Spike Assemblyyes0.7961117
NAMD [COVID-19 Spike Assembly]NRFCOVID-19 Spike Assemblyyes1x8x14x22x
NAMD [stmv_nve_cuda]ns/daystmv_nve_cudayes10.8761121242
NAMD [stmv_nve_cuda]NRFstmv_nve_cudayes1x6x11x22x

NAMD is measured by running multiple independent instances using MPS
Trifan A, Gorgun D, Salim M, et al. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action. The International Journal of High Performance Computing Applications. 2022;36(5-6):603-623. doi:10.1177/10943420221113513
D. B. Sauer, N. Trebesch, J. J. Marden, N. Cocco, J. Song, A. Koide, S. Koide, E. Tajkhorshid, and D.-N. Wang. "Structural basis for the reaction cycle of DASS dicarboxylate transporters." eLife. 9, e61350 (2020). https://doi.org/10.7554/eLife.61350


Quantum Espresso

Material Science (Quantum Chemistry)

An Open-source suite of computer codes for electronic structure calculations and materials modeling at the nanoscale

VERSION

V7.5

ACCELERATED FEATURES

  • linear algebra (matrix multiply)
  • explicit computational kernels
  • 3D FFTs

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.quantum-espresso.org

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
Quantum EspresssoTotal CPU Time (Sec)GRIR443no7841409955
Quantum EspresssoNRFGRIR443yes1x10x14x26x

RTM

Geoscience

Reverse time migration (RTM) modeling is a critical component in the seismic processing workflow of oil and gas exploration

VERSION

nvidia_2025_11

ACCELERATED FEATURES

  • Batch algorithm

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.tsunamidevelopment.com/assets/rtm.pdf

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
RTM [Isotropic Radius 4]Mcell/sIsotropic Radius 4yes21,047305,095609,3251,219,091
RTM [Isotropic Radius 4]NRFIsotropic Radius 4yes1x14x29x58x
RTM [TTI Radius 8 1-pass]Mcell/sTTI Radius 8 1-passyes7,21362,295123,860246,919
RTM [TTI Radius 8 1-pass]NRFTTI Radius 8 1-passyes1x9x17x34x
RTM [TTI RX 2Pass mgpu]Mcell/sTTI RX 2Pass mgpuyes7,21352,428104,577207,187
RTM [TTI RX 2Pass mgpu]NRFTTI RX 2Pass mgpuyes1x7x14x29x

SPECFEM3D

Geoscience

Simulates Seismic wave propagation

VERSION

4.1.1

ACCELERATED FEATURES

  • OpenCL and CUDA hardware accelerators, based on an automatic source-to-source transformation library

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://geodynamics.org/cig/software/specfem3d/

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x H100 SXM4x H100 SXM8x H100 SXM
SPECFEM3DTotal Time (Sec)four_material_simple_modelno186271510
SPECFEM3DNRFfour_material_simple_modelyes1x6x14x22x


Detailed L40S application performance data is located below in alphabetical order.

AMBER

Molecular Dynamics

Suite of programs to simulate molecular dynamics on biomolecule

VERSION

24-AT_24

ACCELERATED FEATURES

  • PMEMD Explicit Solvent and GB Implicit Solvent

SCALABILITY

Multi-GPU and Single Node

MORE INFORMATION

http://ambermd.org/GPUSupport.php

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
AMBER [PME-Cellulose_NPT_4fs]ns/dayDC-Cellulose_NPTyes11.713637381,538
AMBER [PME-Cellulose_NPT_4fs]NRFDC-Cellulose_NPTyes1x31x63x131x
AMBER [PME-Cellulose_NVE_4fs]ns/dayDC-Cellulose_NVEyes11.693707601,604
AMBER [PME-Cellulose_NVE_4fs]NRFDC-Cellulose_NVEyes1x32x65x137x
AMBER [PME-FactorIX_NPT_4fs]ns/dayDC-FactorIX_NPTyes93.361,9854,0968,788
AMBER [PME-FactorIX_NPT_4fs]NRFDC-FactorIX_NPTyes1x21x44x94x
AMBER [PME-FactorIX_NVE_4fs]ns/dayDC-FactorIX_NVEyes99.502,0684,1829,170
AMBER [PME-FactorIX_NVE_4fs]NRFDC-FactorIX_NVEyes1x21x42x92x
AMBER [PME-JAC_NPT_4fs]ns/dayDC-JAC_NPTyes377.048,56517,37737,046
AMBER [PME-JAC_NPT_4fs]NRFDC-JAC_NPTyes1x23x46x98x
AMBER [PME-JAC_NVE_4fs]ns/dayDC-JAC_NVEyes397.048,69718,452-
AMBER [PME-JAC_NVE_4fs]NRFDC-JAC_NVEyes1x22x46x-
AMBER [PME-STMV_NPT_4fs]ns/dayDC-STMV_NPTyes3.69150300600
AMBER [PME-STMV_NPT_4fs]NRFDC-STMV_NPTyes1x41x81x163x
AMBER [FEP-GTI_Complex 1fs]ns/dayFEP-GTI_Complexyes25.073867721,544
AMBER [FEP-GTI_Complex 1fs]NRFFEP-GTI_Complexyes1x15x31x62x

AMBER is measured by running multiple independent instances using MPS


Chroma

Physics

Lattice Quantum Chromodynamics (LQCD)

VERSION

V2025.09

ACCELERATED FEATURES

  • Wilson-clover fermions, Krylov solvers, Domain-decomposition
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
ChromaFinal Timestep Time (Sec)HMC Mediumno10,03736819296
ChromaNRFHMC Mediumyes1x28x53x106x

FUN3D

Engineering

Suite of tools actively developed at NASA for Aeronautics and Space Technology by modeling fluid flow

VERSION

14.2

ACCELERATED FEATURES

  • Full range of Mach number regimes for the Reynolds-averaged Navier Stokes (RANS) formulation

SCALABILITY

Multi-GPU and Single-Node

MORE INFORMATION

https://fun3d.larc.nasa.gov

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
Fun3D [waverider-5M]Loop Time (Sec)waverider-5Mno170834325
Fun3D [waverider-5M]NRFwaverider-5Myes1x3x6x11x
Fun3D [waverider-5M w/chemistry]Loop Time (Sec)waverider-5M w/chemistryno43821010962
Fun3D [waverider-5M w/chemistry]NRFwaverider-5M w/chemistryyes1x2x6x10x
Fun3D [waverider-20M]Loop Time (Sec)waverider-20Mno628-17192
Fun3D [waverider-20M]NRFwaverider-20Myes1x-4x9x
Fun3D [waverider-20M w/chemistry]Loop Time (Sec)waverider-20M w/chemistryno1,927--246
Fun3D [waverider-20M w/chemistry]NRFwaverider-20M w/chemistryyes1x--11x

GROMACS

Molecular Dynamics

Simulation of biochemical molecules with complicated bond interactions

VERSION

2025.3

ACCELERATED FEATURES

  • Implicit (5x), Explicit (2x) Solvent
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
GROMACS [ADH Dodec]ns/dayADH Dodecyes3331,3732,7815,430
GROMACS [ADH Dodec]NRFADH Dodecyes1x4x8x16x
GROMACS [Cellulose]ns/dayCelluloseyes79223223223
GROMACS [Cellulose]NRFCelluloseyes1x3x3x3x
GROMACS [STMV]ns/daySTMVyes2073113114
GROMACS [STMV]NRFSTMVyes1x4x7x7x

GTC

Physics

GTC is used for Gyrokinetic Particle Simulation of Turbulent Transport in Burning Plasmas

VERSION

V4.5 updated

ACCELERATED FEATURES

  • Push, shift, and collision

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
GTCMpush/Secmpi#proc.inyes1468021,5953,154
GTCNRFmpi#proc.inyes1x6x12x23x

LAMMPS

Molecular Dynamics

Classical molecular dynamics package

VERSION

stable_22July2025_update1

ACCELERATED FEATURES

  • Lennard-Jones, Gay-Berne, Tersoff, many more potentials
ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
4x L40S8x L40S
LAMMPS [SNAP]ATOM-Time Steps/sSNAPyes8.40E+052.84E+065.67E+06
LAMMPS [SNAP]NRFSNAPyes1x4x9x
LAMMPS [Tersoff]ATOM-Time Steps/sTersoffyes7.67E+074.91E+089.48E+08
LAMMPS [Tersoff]NRFTersoffyes1x7x15x

MILC

Physics

Lattice Quantum Chromodynamics (LQCD) codes simulate how elemental particles are formed and bound by the “strong force” to create larger particles like protons and neutrons

VERSION

develop_f803f4bf

ACCELERATED FEATURES

  • Staggered fermions, Krylov solvers, Gauge-link fattening

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

https://ngc.nvidia.com/catalog/containers/hpc:milc

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
MILCTotal Time (sec)Apex Mediumno13,9182,0841,082603
MILCNRFApex Mediumyes1x6x11x21x

NAMD

Molecular Dynamics

Designed for high-performance simulation of large molecular systems

VERSION

3.1.a3

ACCELERATED FEATURES

  • Full electrostatics with PME and most simulation features

SCALABILITY

Up to 100M atom capable, multi-GPU, single node

MORE INFORMATION

http://www.ks.uiuc.edu/Research/namd/

https://ngc.nvidia.com/catalog/containers/hpc:namd

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
NAMD [apoa1_npt_cuda]ns/dayapoa1_npt_cudayes100.727571,5062,982
NAMD [apoa1_npt_cuda]NRFapoa1_npt_cudayes1x8x15x30x
NAMD [LaINDY ColVars]ns/dayLaINDY ColVarsyes42.90146292582
NAMD [LaINDY ColVars]NRFLaINDY ColVarsyes1x3x7x14x
NAMD [apoa1_nve_cuda]ns/dayapoa1_nve_cudayes108.796031,2002,388
NAMD [apoa1_nve_cuda]NRFapoa1_nve_cudayes1x6x11x22x
NAMD [stmv_npt_cuda]ns/daystmv_npt_cudayes10.533468136
NAMD [stmv_npt_cuda]NRFstmv_npt_cudayes1x3x6x13x
NAMD [COVID-19 Spike Assembly]ns/dayCOVID-19 Spike Assemblyyes0.7935-
NAMD [COVID-19 Spike Assembly]NRFCOVID-19 Spike Assemblyyes1x4x6x-
NAMD [stmv_nve_cuda]ns/daystmv_nve_cudayes10.874692186
NAMD [stmv_nve_cuda]NRFstmv_nve_cudayes1x4x8x17x

NAMD is measured by running multiple independent instances using MPS
Trifan A, Gorgun D, Salim M, et al. Intelligent resolution: Integrating Cryo-EM with AI-driven multi-resolution simulations to observe the severe acute respiratory syndrome coronavirus-2 replication-transcription machinery in action. The International Journal of High Performance Computing Applications. 2022;36(5-6):603-623. doi:10.1177/10943420221113513
D. B. Sauer, N. Trebesch, J. J. Marden, N. Cocco, J. Song, A. Koide, S. Koide, E. Tajkhorshid, and D.-N. Wang. "Structural basis for the reaction cycle of DASS dicarboxylate transporters." eLife. 9, e61350 (2020). https://doi.org/10.7554/eLife.61350


RTM

Geoscience

Reverse time migration (RTM) modeling is a critical component in the seismic processing workflow of oil and gas exploration

VERSION

nvidia_2025_11

ACCELERATED FEATURES

  • Batch algorithm

SCALABILITY

Multi-GPU and Multi-Node

MORE INFORMATION

http://www.tsunamidevelopment.com/assets/rtm.pdf

ApplicationMetricTest ModulesBigger is betterAMD Dual Genoa 9684X
(CPU-Only)
2x L40S4x L40S8x L40S
RTM [Isotropic Radius 4]Mcell/sIsotropic Radius 4yes21,04784,416168,059336,127
RTM [Isotropic Radius 4]NRFIsotropic Radius 4yes1x4x8x16x
RTM [TTI Radius 8 1-pass]Mcell/sTTI Radius 8 1-passyes7,21328,94057,180114,174
RTM [TTI Radius 8 1-pass]NRFTTI Radius 8 1-passyes1x4x8x16x
RTM [TTI RX 2Pass mgpu]Mcell/sTTI RX 2Pass mgpuyes7,21315,59330,97661,875
RTM [TTI RX 2Pass mgpu]NRFTTI RX 2Pass mgpuyes1x2x4x9x