Sponsored Post
Novosibirsk State University is one of the major research and educational centers in Russia and one of the largest universities in Siberia. When researchers at the University were looking to develop and optimize a software tool for numerical simulation of magnetohydrodynamics (MHD) problems with hydrogen ionization part of an astrophysical objects simulation (AstroPhi) projectthey needed to optimize the tools performance on Intel Xeon Phi processor-based hardware. The team turned to Intel Advisor and Intel Trace Analyzer and Collector. This resulted in a performance speed-up of 3X, cutting the standard time for calculating one problem from one week to just two days.
Mathematical modeling plays a key role in modern astrophysics. It is the universal tool for research of non-linear evolutionary processes in the universe. Modeling the complex astrophysical processes in high resolution takes the most powerful supercomputers. The Universitys AstroPhi project develops astrophysical code for massively parallel supercomputers with Intel Xeon Phi processors. This valuable project helps students learn to create numerical simulation code for massively parallel supercomputers. The students also learn about modern HPC hardware architecturespreparing them to develop tomorrows exascale supercomputers.
The use of Intel Advanced Vector Extensions for Intel Xeon Phi processors gave us the maximum code performance compared with other architectures available on the market, said Igor Kulikov, Assistant Professor, Novosibirsk State University.
Numerical Method
The team designed the project using a numerical method shown in the figure below. The benefits of this high-order method included:
The first three benefits are the key factors for realistic modeling of all the significant physical effects in astrophysical problems. The simplicity of the method, plus the small number of MPI send/receive operations, provides efficient parallelizationand potentially infinite scalability in terms of weak scalability.
Massively Parallel Architecture
The team co-designed the new solver for massively parallel architecture based on Intel Xeon Phi processors. Designed to help eliminate node bottlenecks and simplify code modernization, the bootable processors provided the power efficiency the team needed to handle the most demanding high-performance computing applications.
The team based the solver on Intel Advanced Vector Extensions 512 (Intel AVX-512) instructions, which deliver 512-bit SIMD support and enable programs to pack eight double-precision or 16 single-precision floating-point numbers, or eight 64-bit integers, or 16 32-bit integers within the 512-bit vectors. This enables processing of 2X the number of data elements that AVX/AVX2 can process with a single instruction, and 4X that of SSE.
The use of Intel Advanced Vector Extensions 512 for Intel Xeon Phi processors gave us the maximum code performance compared with other architectures available on the market, said Igor Kulikov, assistant professor at NSU.
Optimizing the Code
A key aspect of the AstroPhi project was optimizing the code for maximum performance on the Intel Xeon Phi processors. Before optimization, the team had some problems with vector dependencies and vector sizes. The goals for optimizing the code were to remove vector dependencies and optimize memory load operations, efficiently adapting vector and array sizes for the Intel Xeon Phi architecture. The team used Intel Advisor and Intel Trace Analyzer and Collector, two tools that are part of Intel Parallel Studio XE, for the optimization.
Intel Parallel Studio XE is a comprehensive software development suite that helps developers maximize application performance on todays and future processors by taking advantage of the ever-increasing processor core count and vector register width.
Intel Advisor is a software tool based on the fact that for modern processors, it is crucial to both vectorize (use AVX* or SIMD* instructions) and thread software to realize the full performance potential of the processor. Using this tool, the team was able to perform a roofline analysis highlighting poor-performing loops and showing performance headroom for each loop, identifying which can be improved and which are worth improving.
Intel Advisor made it easier to find the cause of bottlenecks and decide on next optimization steps, explained Igor Chernykh, assistant professor at NSU. It provided data to help us forecast the performance gain before we invested significant effort in implementation.
Intel Advisor sorted loops by potential gain, making compiler reports easier to read by showing messages on the source, and giving the project team tips for effective vectorization. It also provided key data like trip counts, data dependencies, and memory access patterns make vectorization safe and efficient.
Intel Trace Analyzer and Collector was another help in optimizing the code. This graphical tool helped the team understand MPI application behavior, quickly find bottlenecks, improve correctnessand, ultimately, maximize the tools performance on Intel architecture. It includes MPI communications profiling and analysis features that helped to improve weak and strong scaling.
Results
After all the improvements and optimizations, the team achieved 190 GFLOPS performance and 0.3 FLOP/byte arithmetic intensity, with 100 percent mask utilization and 573 GB/s memory bandwidth.
Using Intel Advisor and Intel Trace Analyzer and Collector, we were able to remove vector dependencies, optimize load operations, and adapt vector and array size for the Intel Xeon Phi architecture, explained Kulikov. This optimization gave the opportunity to run 3X more variants of astrophysical tests.
Download your free 30-day trial of Intel Parallel Studio XE
Visit link:
Case Study: More Efficient Numerical Simulation in Astrophysics - insideBIGDATA
- Rotational spectra of isotopic species of methyl cyanide, CH_3CN, in their ground vibrational states up to terahertz frequencies [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Cosmological parameter extraction and biases from type Ia supernova magnitude evolution [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Continuous monitoring of pulse period variations in Hercules X-1 using Swift/BAT [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Constraining the ortho-to-para ratio of H{_2} with anomalous H{_2}CO absorption [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- A photometric and spectroscopic study of the new dwarf spheroidal galaxy in Hercules - Metallicity, velocities, and a clean list of RGB members [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Luminosities and mass-loss rates of SMC and LMC AGB stars and red supergiants [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Electron beam – plasma system with the return current and directivity of its X-ray emission [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The propagation of the shock wave from a strong explosion in a plane-parallel stratified medium: the Kompaneets approximation [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Analysis of hydrogen-rich magnetic white dwarfs detected in the Sloan Digital Sky Survey [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Letter: Centaurus A as TeV \gamma-ray and possible UHE cosmic-ray source [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Young pre-low-mass X-ray binaries in the propeller phase - Nature of the 6.7-h periodic X-ray source 1E 161348-5055 in RCW 103 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Radiative rates and electron impact excitation rates for transitions in Cr VIII [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Solar granulation from photosphere to low chromosphere observed in Ba II 4554 Å line [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Does the HD 209458 planetary system pose a challenge to the stellar atmosphere models? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Effect of asymmetry of the radio source distribution on the apparent proper motion kinematic analysis [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Destriping CMB temperature and polarization maps [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Search for cold debris disks around M-dwarfs. II [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Precise data on Leonid fireballs from all-sky photographic records [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- An X-ray view of 82 LINERs with Chandra and XMM-Newton data [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Radio observations of ZwCl 2341.1+0000: a double radio relic cluster [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Candidate free-floating super-Jupiters in the young \sigma Orionis open cluster [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The metallicity gradient as a tracer of history and structure: the Magellanic Clouds and M33 galaxies [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- XMMSL1 J060636.2-694933: an XMM-Newton slew discovery and Swift/Magellan follow up of a new classical nova in the LMC [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The inner rim structures of protoplanetary discs [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The solar Ba{\sf II} 4554 Å line as a Doppler diagnostic: NLTE analysis in 3D hydrodynamical model [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Magnetic evolution of superactive regions - Complexity and potentially unstable magnetic discontinuities [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Low-mass protostars and dense cores in different evolutionary stages in IRAS 00213+6530 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- PMAS optical integral field spectroscopy of luminous infrared galaxies - I. The atlas [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- First AGILE catalog of high-confidence gamma-ray sources [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Radiative hydrodynamics simulations of red supergiant stars - I. interpretation of interferometric observations [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Extrasolar planets and brown dwarfs around A–F type stars - VII. \theta Cygni radial velocity variations: planets or stellar phenomenon? [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Cosmic rays and the magnetic field in the nearby starburst galaxy NGC 253 - II. The magnetic field structure [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Physical structure and water line spectrum predictions of the intermediate mass protostar OMC2-FIR4 [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The bright galaxy population of five medium redshift clusters - II. Quantitative galaxy morphology [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Dust in brown dwarfs and extra-solar planets - II. Cloud formation for cosmologically evolving abundances [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The quiet Sun magnetic field observed with ZIMPOL on THEMIS - I. The probability density function [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Complexity in the sunspot cycle [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Properties and nature of Be stars - 26. Long-term and orbital changes of \zeta Tauri [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The massive Wolf-Rayet binary LSS 1964 (=WR 29) - II. The V light curve [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Supernova progenitor stars in the initial range of 23 to 33 solar masses and their relation with the SNR Cassiopeia A [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Hertzsprung-Russell Diagram of Star Clusters [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Table of the 10 Brightest stars within 10 Parsecs of the Sun [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Hertzsprung-Russell Diagram of the Nearest Stars [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Magnitude and Color in Astronomy [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Stellar Types [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Brown Dwarfs [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Spotting the Minimum [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Structure and Evolution of Brown Dwarfs [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- No Bang from the Big Bang Machine [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Sizes of the Stars and the Planets [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- An Implausible Light Thrust [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- the Masses of Degenerate Objects [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Degeneracy Pressure [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Introduction to Degenerate Objects [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Radii of Degenerate Objects [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Inevitability of Black Holes [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Scientific Pig-Out [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Neutrino Cooling of Degenerate Dwarfs [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- The Neutrino Cooling of Neutron Stars [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Overview of Supernovae [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Energetics of Thermonuclear Supernovae [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Thermonuclear Supernovae [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Nuclear Reactions in Thermonuclear Supernovae [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Core-Collapse Supernovae [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Neutrinos and SN 1987A [Last Updated On: November 8th, 2009] [Originally Added On: November 8th, 2009]
- Revealing the sub-AU asymmetries of the inner dust rim in the disk around the Herbig Ae star R Coronae Austrinae [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Probing the dust properties of galaxies up to submillimetre wavelengths - I. The spectral energy distribution of dwarf galaxies using LABOCA [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- On the physical origin of the second solar spectrum of the Sc II line at 4247 Å [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- On detecting the large separation in the autocorrelation of stellar oscillation times series [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Imaging the spotty surface of Betelgeuse in the H band [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Chandra observation of Cepheus A: the diffuse emission of HH 168 resolved [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- A planetary eclipse map of CoRoT-2a - Comprehensive lightcurve modeling combining rotational-modulation and transits [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- The chemical composition of carbon stars. The R-type stars [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Flow instabilities of magnetic flux tubes - IV. Flux storage in the solar overshoot region [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Fragmentation of a dynamically condensing radiative layer [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Temporal variations of the CaXIX spectra in solar flares [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Deuterium chemistry in the Orion Bar PDR - “Warm” chemistry starring CH_{2}D^+ [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Metal abundances in the cool cores of galaxy clusters [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- The nature of the X-ray binary IGR J19294+1816 from INTEGRAL, RXTE, and Swift observations [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]
- Relating basic properties of bright early-type dwarf galaxies to their location in Abell 901/902 [Last Updated On: December 13th, 2009] [Originally Added On: December 13th, 2009]