Abstract:
Numerical weather model is a basic method and tool of weather forecasting. As the development of technology, the model resolution has been improved greatly, it, however, brings an exponentially-increasing computation cost. The timeliness for the weather forecasting puts forward more advanced requests to the program designing and the performance of computing platform.In this paper, we take GRAPES_GLOBAL as an example, to explore the feasibility of hybrid programming and optimization on the Sunway TaihuLight (new domestic high-performance computing system). By analyzing the program structure and calculation principle, we find hotspot subroutines which are in fluencing the parallel ef ficiency and extensibility, then we design an ef ficient communication strategy for each subroutine in the long-wave radiation by adjusting the structure of program and adding the coprocessor acceleration indicator statements. The implement of the hybrid programming of MPI parallel computation on CPU and OpenACC parallel computation on Many-Core shows that an acceleration ratio of hotspot subroutines is 3-10 times faster than before. This experimental results may con firm that the methods can inherit the most of the original MPI parallel computing codes and reduce the developing costs signi ficantly.