Performance targets were exceeded in machine learning and molecular dynamics, which are critical to advancing scientific research and discovery
Stuttgart, Germany—May 31, 2022—Inspur information, a leading provider of IT infrastructure solutions, and MEGWARE, a provider of high-performance computing (HPC) solutions in Europe, have joined forces to strengthen the scientific research capabilities of the Friedrich-Alexander- Universität Erlangen-Nürnberg (FAU) through its Erlangen National Center for High-Performance Computing ([email protected]). The advanced GPU cluster, powered by Inspur GPU servers, is fully operational and has far exceeded its initial machine learning and molecular dynamics performance targets.
FAU is a leading scientific research institution in Europe and ranks second among the most innovative European universities according to Reuters. It is renowned for its science and engineering in fields such as materials science, chemistry, life sciences, computer science and biomedical engineering. Machine learning (ML) has become increasingly important for many research areas at FAU, especially in computer science. In addition to ML, molecular dynamics (MD) simulations have made possible the numerical simulation of many real and complex physical models at FAU, and the demand for simulating these models using HPC is growing exponentially.
To meet these huge parallel computing needs, [email protected] sought to build the largest computing cluster in the university’s history, greatly expanding its research and HPC capabilities. As part of the “NHR Alliance”, which is a federation of nine Tier 2 IT centers in Germany, the new [email protected] would also be open to researchers from other German universities. A European call for tenders by [email protected] led to the selection of Inspur Information and MEGWARE due to their combination of powerful GPU servers, system integration and optimization expertise.
The new Inspur-powered “Alex” GPU cluster is the core component of [email protected]HPC infrastructure to handle the rapidly growing demands on computing resources for ML and MD in scientific research. Alex is part of the TOP500 and Green500 of the most powerful and energy efficient HPC systems in the world. It is comprised of 32 Inspur NF5488A5 and 38 NF5468A5 GPU servers, providing a total of 256 NVIDIA A100 Tensor Core GPUs and 304 NVIDIA A40 Tensor Core GPUs for maximum GPU computing performance. In addition to the huge GPU resources available, there are 140 AMD EPYC 7713 processors and the total memory capacity is nearly 50TB. The cluster is interconnected via a high-speed InfiniBand HDR network, resulting in versatile computing of high-level with excellent MD and AI performance that runs a host of research-specific software with various hardware, while supporting massive ML datasets, molecular dynamics simulations and improving training efficiency.
As the fundamental component of the Alex GPU cluster, Inspur GPU servers deliver powerful performance:
- Inspur NF5488A5 is equipped with 8 NVIDIA A100 GPUs and 2 AMD EPYC 7713 64-core processors in a 4U chassis and uses an NVSwitch GPU interconnect. The design emphasizes performance while reducing operating and maintenance costs and eases installation.
- Inspur NF5468A5 is equipped with 8 NVIDIA A40 Tensor Core GPUs and 2 AMD EPYC 7713 CPUs in a 4U chassis. It uses a high-speed PCIe 4.0 interface for CPUs and GPUs without using a PCIe switch, which eliminates communication delays between CPUs and GPUs and improves computing performance.
Inspur and MEGWARE’s HPC solution has significantly enhanced FAU’s scientific research capabilities. Performance for model training and inference exceeded initial FAU expectations by 115% after Inspur provided hardware recommendations better optimized for FAU needs, including the use of Inspur’s flagship NF5488A5 servers and NF5468A5.
[email protected]Alex’s cluster running on Inspur GPU servers successfully runs applications such as ML (Tensorflow, PyTorch), chemical applications (Quantum Espresso and VASP) and scientific research software such as NAMD, LAMMPS, AMBER, GROMACS , etc. FAU and German universities are now able to perform scientific research that was otherwise impossible just a few years ago, and they are now at the forefront of scientific exploration.
Inspur owns the world’s leading GPU server product portfolio, delivering industry-leading performance, comprehensive products, and rapid time-to-market capabilities. Inspur GPU servers are widely used in image recognition, speech recognition, natural language processing and other fields. Inspur offers a rich selection of NVLink A100 GPU Servers and PCIe GPUs. Based on innovative designs and comprehensive performance optimization capabilities, Inspur has been one of the top performers in MLPerf, a leading AI benchmark suite, receiving 91 top single node performance rankings since MLPerf Inference 0.7. According to IDC, the global AI server market reached $6.66 billion in 1H2021, with Inspur accounting for 20.2% market share, maintaining Inspur’s position as the largest AI server vendor in the world.
About Inspur Information
Inspur Information is a leading provider of data center infrastructure, cloud computing and artificial intelligence solutions. It is the 2nd largest server manufacturer in the world. Through engineering and innovation, Inspur Information delivers industry-leading hardware design and broad product offerings to address important technology sectors such as open computing, cloud data centers and AI. Performance-optimized and purpose-built, our world-class solutions enable customers to tackle specific workloads and real-world challenges. To learn more, visit https://www.inspursystems.com.
# # #
For more information, contact:
Interprose for Inspur Information
+49 (0)151 210 289 97