Today, we’re excited to announce the Public Preview of a CentOS 7-based Virtual Machine (VM) image optimized for high performance computing (HPC) workloads, with a focus on tightly-coupled MPI workloads.
Improved MPI performance translates directly to improved application scaling, expanding the set of HPC workloads that run efficiently on Google Cloud. However, building a VM image that includes these best practices requires systems expertise and knowledge of Google Cloud. Starting with an HPC-optimized image can make it easier to maintain an image.
The HPC VM image makes it easy and quick to instantiate VMs that are tuned to achieve optimal CPU and network performance on Google Cloud. The HPC VM image is available at no additional cost via theGoogle Cloud Marketplace .
Continue reading below for details about the HPC VM image and its benefits, or skip ahead to ourdocumentation and quickstart guide to start creating instances using the HPC VM image today!
Benefits of using the HPC VM image
The HPC VM image is pre-configured and regularly maintained, providing the following advantages to HPC customers on Google Cloud :
Using the HPC VM image is simple and easy, as it is a drop-in replacement for the standard CentOS 7 image.
Customer story : Scaling SDPB solver using CloudyCluster and HPC VM image
To expand the collaboration’s computation capabilities, Landry wanted to see how SDPB would scale on Google Cloud. Working withOmnibond CloudyCluster and leveraging the HPC VM image, Landry achieved comparable performance and scaling to an on-premises cluster at Yale, based on Intel Xeon Gold 6240 processors and Infiniband FDR.
Google Cloud’s C2-Standard-60 instance type is based on the second-generation Intel Xeon Scalable Processor. The C2 family of instances can utilize placement policies to reduce inter-node latency, ideal for tightly-coupled MPI workloads. CloudyCluster leverages the HPC VM image and placement policy for the C2 family out of the box, making it seamless for the researcher. These tests show the ability to scale low latency workloads across many instances in Google Cloud.
If you would like to try out the HPC VM image with Omnibond CloudyCluster, an updated version of Omnibond CloudyCluster using the HPC VM image is available in theGoogle Cloud Marketplace . This version also comes complete with NSF fundedOpen OnDemand led by Ohio Supercomputer Center, making it easy for system administrators to provide web access to HPC resources.
What’s included in the HPC VM image?
Tunings and Optimizations
The current release of the HPC VM image focuses on tunings for tightly coupled HPC workloads and implements the following best-practices for optimal MPI application performance :
The benefits of these tunings can vary from application to application and we recommend that you benchmark your applications to find the most efficient or cost-effective configuration.
Performance measurement using HPC benchmarks
We have compared the performance of the HPC VM image vs. the default CentOS 7 image across both the Intel MPI Benchmarks and real application benchmarks for Finite Element Analysis (ANSYS LS-DYNA), Computational Fluid Dynamics (ANSYS Fluent) and Weather Modeling (WRF).
The following versions of the HPC VM image and CentOS Image were used for the benchmarks in this section :
Intel MPI Benchmark (IMB) Ping-Pong
IMB Ping-Pong measures the ping-pong latency of transferring a fix-sized message between two ranks over a pair of VMs. On average, we saw that the HPC VM image reduces inter-node ping-pong latency by up to 50% compared to the default CentOS 7 Image (baseline).
Benchmark setup
Results
Intel MPI Benchmark (IMB) AllReduce
The IMB AllReduce benchmark measures the collective latency among multiple ranks across VMs. It reduces a vector of a fixed length with the MPI_SUM operation. We show 1 PPN (process-per-node) results to represent the case when we have a 1 MPI rank / node and 30 threads / rank and 30 PPN results where there are 30 MPI ranks / node and 1 thread / rank. We saw that the HPC VM image reduces AllReduce latency by up to 40% for 240 MPI ranks across 8 nodes (30 processes per node) compared to the default CentOS 7 image (baseline).
Benchmark setup
IMB-MPI1 AllReduce -iter 50000 -npmin <#vm
Results
HPC application benchmarks : LS-DYNA, Fluent and WRF
At an application level, the HPC VM image yielded up to a 25% performance improvement to the ANSYS LS-DYNA “3 cars” vehicle collision simulation benchmark when running on 240 MPI ranks across 8 Intel Xeon processor based C2 instances. With ANSYS Fluent and WRF, we observed up to 6% performance improvement using the HPC VM image in comparison with the default CentOS Image.
Benchmark setup
Results
What’s next? SchedMD Slurm support and additional Linux distributions
We are continuing to work with our HPC partners to integrate the HPC VM image with partner offerings by default. Starting next month, HPC customers who use Slurm will be able to start HPC-ready clusters that make use of the HPC VM image by default (preview version is available here ).
For customers who are looking for HPC Enterprise Linux options and support, SUSE is working with Google on a SUSE Enterprise HPC VM image that has been optimized for Google Cloud. If you’re interested in learning more about SUSE Enterprise HPC VM image, or have a requirement for additional integrations or Linux distributions, pleasecontact us .
Special thanks to Jiuxing Liu, Tanner Love, Jian Yang, Hongbo Lu and Pallavi Phene for their contributions.
By Pavan Kumar(Product Manager) and Jason Zhang(Software Engineering Manager)
Source : Google Cloud Blog
For enquiries, product placements, sponsorships, and collaborations, connect with us at . We'd love to hear from you!
Our humans need coffee too! Your support is highly appreciated, thank you!
Getting Vaccines Into Local Communities Safely And Effectively
Next Article
Introducing HPC VM Images—Pre-Tuned For Optimal Performance
You are what you eat. We’ve all been told this, but the truth is what we eat is…
Let’s say your new application has been a hit. Usage is growing across the world and you now…
Late last year, Google Cloudjoinedthe EOS community, a leading open-source platform for blockchain innovation and performance, and is…
Today’s fastest growing companies are building and scaling in the cloud. Turning to a modern cloud as a…
J-18808-Ljbffr
Google Cloud • , , Spain, España