ABSTRACT
Forest dynamic simulation plays an important role in studying the long-term development of forest structure and composition. Seeds dispersal, the initial step of forest dynamic process, forms the vital component of forest dynamic simulation. However, it is time-consuming to perform the realistic and spatially-explicit seeds dispersal process. We propose a CUDA-accelerated parallel algorithm, by partitioning the seeds dispersal problem into fine sub-problems and using CUDA programming model to map these sub-problems to parallel processing threads. Meanwhile, we implement the parallel algorithm with CUDA optimization strategies to make full use of the GPU computing capacity. The Experimental results show a good speedup compared to the naive implementation on CPU.
- Clark, J. S., Miles, S., Ruth, K., Eric, M., and Janneke, H. 1999. Seed dispersal near and far: Patterns across temparate and tropical forests. Ecology 80, 1475--1494.Google ScholarCross Ref
- Han, B., and Zhou, B. 2009. A performance model for general-purpose computation on gpu. Journal of Computer-Aided Design & Computer Graphics 21, 1219--1226.Google Scholar
- Harald, B. 2001. A review of forest gap models. Climate Change 51, 259--305.Google ScholarCross Ref
- Lepage, P. T., Canham, C. D., and Bartemucci, P. 2000. Seed abundance versus substrate limitation of seedling recruitment in northern temperate forests of british columbia. Canadian Journal of Forest Research 30, 415--427.Google ScholarCross Ref
- Lin, J., Tang, M., and Tong, R. 2010. Gpu accelerated biological sequence alignment. Journal of Computer-Aided Design & Computer Graphics 22, 420--426.Google Scholar
- Nickolls, J., Buck, I., Garland, M., and Skadron, K. 2008. Scalable parallel programming with cuda. Queue - GPU Computing 6, 2 (Mar./Apr.), 40--53. Google ScholarDigital Library
- Sang, W., Ma, K., Chen, L., and Zheng, Y. 1999. A brief review on forest dynamics models. Chinese Bulletin of Botany 16, 3, 193--200.Google Scholar
- Wu, L., Zhong, C., Zheng, Y., Liu, S., Zhuo, C., and Chen, X. 2010. Accelerating lattice-boltzmann method with multi-gpus. Journal of Computer-Aided Design & Computer Graphics 22, 1932--1939.Google Scholar
Index Terms
Design and implementation of seeds dispersion on graphic processor unit
Recommendations
Accelerating the discontinuous Galerkin method for seismic wave propagation simulations using the graphic processing unit (GPU)-single-GPU implementation
We have successfully ported an arbitrary high-order discontinuous Galerkin (ADER-DG) method for solving the three-dimensional elastic seismic wave equation on unstructured tetrahedral meshes to an Nvidia Tesla C2075 GPU using the Nvidia CUDA programming ...
Performance Comparison of CUDA and OpenACC Based on Optimizations
HPCCT '18: Proceedings of the 2018 2nd High Performance Computing and Cluster Technologies ConferenceBased on various optimizations, this paper presents a performance comparison between CUDA and OpenACC using 19 kernels in 10 benchmarks. The performance analysis focuses on programming models, optimization technologies and underlying compilers. It ...
Out-of-core implementation for accelerator kernels on heterogeneous clouds
Cloud environments today are increasingly featuring hybrid nodes containing multicore CPU processors and a diverse mix of accelerators such as Graphics Processing Units (GPUs), Intel Xeon Phi co-processors, and Field-Programmable Gate Arrays (FPGAs) to ...
Comments